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ABSTRACT 

This  thesis  is  concerned  with  the  modelling  and  performance  analysis 
of  systems  which  consist  of  concurrently  acting  components,  an  example  of 
which  is  an  asynchronous  pipelined  processor.  The  work  is  divided  into  two 
parts . 

In  the  first  part,  a  suitable  model  is  developed  for  describing  the 
structure  of  asynchronous  concurrent  systems.  In  conventional  automata 
theory,  the  finite— state  machine  model  is  used  to  describe  the  behavior 
of  systems;  the  problem  with  this  is  that  a  large  number  of  itates  re¬ 
sults  when  practical  systems  are  modelled.  In  this  thesis,  each  system 
component  is  modelled  as  a  finite-state  machine,  and  a  system  is  viewed 
as  an  ensemble  of  interconnected  finite-state  machines .This  has  the  ad¬ 
vantage  that  the  size  of  a  system  model  grows  linearly  rather  than  expo¬ 
nentially  with  the  number  c  system  components.  A  subclass  of  Petri  nets 
known  as  SMD  (State  Machine  Decomposable)  Petri  nets  is  identified  in 
order  to  formalize  the  notions  of  finite-state  machines  and  their  inter¬ 
connection.  For  convenience,  systems  of  interest  are  divided  into  two  broad 
categories : 

(a)  Deterministic,  or  decision-free. 

(b)  Non-deterministic,  or  systems  with  decisions. 

SMD  Petri  nets  are  used  to  model  both  classes  of  systems;  in  addition,  a 
subclass  of  Petri  nets  known  as  LSP  Petri  nets  is  used  to  model  those 
deterministic  systems  that  cannot  be  modelled  by  SMD  Petri  nets. 

The  second  part  of  the  thesis  is  concerned  with  finding  the  computation 
rate  of  activities  in  real-world  asynchronous  concurrent  systems.  Practi¬ 
cal  systems  are  constructed  from  devices  which  have  a  finite  speed  of  ope¬ 
ration.  Since  Petri  nets  do  not  have  time  parameters  as  part  of  their 
definition,  they  can  model  the  structure  of  systems  but  cannot  be  used  to 
study  their  computation  rate.  The  definition  of  Petri  nets  is  augmented 
to  model  the  speed  of  operation  of  a  device  in  a  system  by  assuming  that 
the  corresponding  activity  in  the  Petri  net  has  a  finite,  non-zero  time 
duration.  The  resulting  nets  are  termed  timed  Petri  nets,  and  methods  are 
given  for  finding  the  computation  rate  of  activities  in  timed  SMD  and  LSP 
Petri  nets.  The  results  are  applied  to  the  analysis  of  several  asynchro¬ 
nous  systems  drawn  from  areas  within  and  outside  the  domain  of  computer 
systems  . 

THESIS  SUPERVISOR:  Jack  B.  Dennis 
TITLE:  Professor  of  Electrical  Engineering 
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CHAPTER  1 


INTRODUCTION 


1.1  Background 

Tho  philosophy  of  this  thesis  can  be  described  very  effectively  by  the 
following  quotation  from  T.C.  Chen  [C2j: 

In  the  quest  for  performance  above  and  beyond  that  deliverable 
by  hardware  componentry,  two  alternative  multiprocessing  ap¬ 
proaches  to  computer  design  can  be  taken.  One  is  to  subdivide 
each  oncoming  joh  among  many  identically  constructed  mechanisms, 
and  is  commonly  referred  to  as  parallelism.  The  other  alternative 
is  to  develop  a  collection  of  specialized  mechanisms  capable  of 
working  simultaneously  to  form  a  general  purpose  organization; 
this  is  commonly  called  over  lap ,  and  an  extreme  form  of  this  is 
called  pipelining. 

It  is  well  known  that  present  day  machines  are  very  wasteful  in  terms  of 
resource  utilization.  One  of  the  original  arguments  for  time-shared 
multiprogrammed  systems  was  the  fact  that  they  could  permit  better  utiliza¬ 
tion  of  system  resources  than  a  batch-processing  system  by  overlapping  che 
operation  of  the  processor,  primary  storage  and  secondary  storage.  The 
processor  in  contemporary  computer  systems  is  treated  as  a  resource  unit 
which  is  allocated  to  a  user  job  or  a  task  within  a  user  job.  Each 
processor  typically  consists  of  smaller  processing  units  like  adders, 
multipliers,  an  instruction  fetch  unit  and  an  instruction  decode  unit, 
all  of  which  can  potentially  be  operated  concurrently  with  each  other. 

If  it  were  possible  to  achieve  this  degree  of  concurrency,  a  much  greater 
processing  rate  could  be  realized  as  a  consequence.  Of  course,  the  over- 


all  throughput  of  the  machine  depends  not  only  on  the' instruction  pro¬ 
cessing  rate  of  the  processor,  but  also  on  factors  like  the  speed  of 
the  main  memory,  its  degree  of  interleaving,  the  type  of  secondary 
storage  and  finally  on  issues  like  the  job  mix  and  the  scheduling  strategy 
used.  But  by  suitably  reconfiguring  main  and  secondary  memory,  the  bene¬ 
fits  of  added  concurrency  in  the  processor  could  be  realized  as  added 
throughput  in  the  overall  system.  In  most  contemporary  processors,  over¬ 
lapped  operation  of  functional  units  within  the  processor  is  restricted 
to  concurrent  fetching  and  decoding  of  instructions.  Many  systems  can  be 
run  in  a  multiprocessor  configuration  so  that  real  parallelism  is  possible. 
However,  this  parallelism  is  between  user  jobs  or  between  different  tasks 
of  the  same  user  job,  a  situation  which  we  term  macro -parallelism.  Dennis  [D3] 
has  advocated  a  computer  organization  in  which  parallel  operation  is  pos¬ 
sible  right  down  to  the  level  of  instructions  in  a  computer  program  or  a 
user  task.  We  term  this  micro -parallelism  and  it  is  clearly  impractical 
on  contemporary  machines  because  of  the  overhead  involved  in  switching  the 
processor  between  instruction  streams. 

Jack  Dennis  has  looked  at  the  issues  involved  in  designing  a  computer 
system  to  support  r  icro-parallelism,  and  readers  interested  in  his  pro¬ 
posals  for  a  memory  organization  and  a  representation  scheme  for  programs 
and  information  structures  are  referred  to  hi s  paper  [D3] .  We  quote  his 
remarks  about  the  processing  hardware: 

The  organization  of  the  processing  hardware  is  intended  to  per¬ 
mit  extensive  sharing  of  multiple  specialized  cells  by  many 
computations  to  ensure  statistically  high  utilization.  It  is 
envisioned  that  there  be  tens  to  hundreds  of  units  of  each  cell 
class,  operating  independently  and  asynchronously  using  a  ser¬ 
vice  on  demand  principle  of  control. 
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The  control  needed  in  such  a  large  complex  system  would  be  too  formidable 
to  tackle  with  a  centrally  clocked  or  a  synchronous  organization.  Dennis 
has  ,*ightly  pointed  out  the  need  for  asynchronous  operation  of  such  a 
system.  Each  "cell"  in  Dennis'  system  corresponds  to  a  functional  unit 
of  a  certain  type,  like  an  adder  or  an  instruction  decoder,  and  the  opera¬ 
tion  of  a  cell  type  will  be  overlapped  or  pipelined  with  the  operation  of 
other  cell  types.  Thus,  by  a  combination  of  parallelism  and  overlap,  a 
statistically  high  instruction  throughput  should  result.  We  will  refer  to 
such  systems  as  "asynchronous  concurrent  systems"  and  will  have  occasion 
to  use  this  term  often  in  the  course  of  the  thesis.  Dennis  and  Patil  [Dl, 
D2]  have  addressed  themselves  to  the  problem  of  evolving  tools  for  the 
description  and  implementation  of  such  systems.  They  have  made  much 
progress,  and  their  efforts  have  resulted  in  a  systematic  technique  for 
both  the  description  of  asynchronous  concurrent  systems  together  with  a 
methodology  for  their  realization  using  a  set  of  elementary  modules  [Dl, 

D2]  .  In  this  thesis,  we  would  like  to  address  ourselves  to  the  question 
of  modelling  such  systems  with  a  view  to  predicting  their  performance. 

The  performance  of  an  asynchronous  processor  such  as  visualized  by  Dennis 
will  depend  upon  several  factors,  some  of  which  are  listed  below: 

a)  The  number  of  functional  units  of  each  type. 

b)  The  speed  of  operation  of  each  functional  unit. 

c)  Statistical  properties  of  user  jobs,  e.g.,  their  degree  of 
parallelism,  relative  frequencies  of  the  different  instructions. 

d)  Gross  statistical  properties  of  user  jobs,  i.e.,  job  arrival 
rates,  mean  length  of  user  jobs. 

Before  we  can  incorporate  these  factors  into  a  model  for  performance 
evaluation  of  asynchronous  computer  organizations,  we  must  come  up  with  a 


suitable  descriptive  tool  or  model  which  can  represent  the  structure  of 
such  systems  in  a  manner  that  is  both  simple  and  accurate.  In  particular, 
factors  like  (a)  and  (b)  should  be  readily  incorporated  into  the  model. 

Models  for  asynchronous  systems  as  they  currently  stand  can  successful¬ 
ly  describe  the  structure  of  asynchronous  systems,  i.e.,  the  structure 
and  interconnection  of  the  parts  of  the  system.  However,  they  do  not 
have  a  way  of  incorporating  information  about  factors  like  (b)  and  (c) 
as  part  of  their  definition.  In  this  thesis,  we  have  developed  a  model 
in  which  factors  (a),  (b>,  and  (c)  can  be  described  in  a  natural  and 
simple  way.  Thus,  given  a  description  of  the  structure  of  the  system 
and  the  speed  of  operation  of  its  parts  together  with  statistics  on  the 
utilization  each  part,  we  can  obtain  a  measure  of  the  throughput  of  the 
system.  If  the  system  we  are  considering  is  an  asynchronous  pipelined 
processor,  we  can  obtain  a  measure  of  its  processing  rate,  given  that  we 
have  information  of  types  (a),  (b)  and  (c)  available  to  us.  The  actual 
throughput  of  such  a  processor  when  connected  to  memory  units  and  in  the 
presence  of  user  jobs  is  not  easy  to  find.  We  will  not  study  issues  of 
type  (d)  in  connection  with  the  performance  of  asynchronous  computer 
systems,  but  will  concern  ourselves  with  finding  an  index  of  performance 
which  we  will  call  its  information  processing  capacity  or  computation 
rate . 

Our  approach  will  be  to  study  an  existing  model  for  asynchronous  sys¬ 
tems  and  explore  in  depth  its  applications  to  the  modelling  of  various 
types  of  concurrent  systems,  including  pipelined  organizations  in  whicu 
parallelism  may  be  present  in  each  stage.  This  model,  while  adequate  for 
describing  the  structure  of  asynchronous  systems,  does  not  contain  in¬ 
formation  regarding  the  speed  of  operation  of  system  components  or  any 
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information  about  the  statistical  utilization  of  the  parts  of  a  system. 
We  show  how  such  information  can  be  incorporated  into  the  model.  The 
properties  of  this  augmented  model  are  studied,  and  a  technique  is  given 
for  analyzing  the  throughput  rate  of  a  large  variety  of  asynchronous 
processing  or  computing  systems.  The  model  we  have  developed  and  the 
analysis  techniques  for  it  were  motivated  by  the  desire  to  study  perfor¬ 
mance  issues  in  asynchronous  computer  systems .  However,  the  work  can  be 
applied  with  equal  facility  to  analyzing  numerous  systems  outside  the 
realm  of  computer  systems;  the  best  example  is  that  of  an  assembly  line, 
and  we  shall  interchangeably  talk  about  pipelined  processing  systems  or 
assembly  lines,  because  both  share  fundamental  characteristics  such  as 
overlapped  and  parallel  operation.  In  the  next  section,  we  pursue  the 
modelling  of  asynchronous  systems  in  some  depth,  and  illustrate  the 
spirit  and  flavor  of  the  thesis  by  concrete  examples. 
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1.2  Modelling  Asynchronous  Systems 

Our  study  of  various  types  of  asynchronous  concurrent  systems  has  led 
us  to  the  conclusion  that,  by  and  large,  we  would  like  to  distinguish 
between  two  broad  classes  of  systems: 

a)  deterministic 

b)  non-deterministic  (i.e.,  having  decisions). 

Let  us  explain  briefly  what  we  mean  by  this  distinction. 

Deterministic  Systems :  We  will  call  an  asynchronous  system  deterministic 
if  during  the  course  of  operation  of  the  system,  there  is  never  a 
situation  when  a  decision  has  to  be  made  between  alternative  courses  of 
action  in  the  system.  An  example  will  explain  what  we  mean.  Consider 
an  automobile  assembly  line  in  which  only  one  kind  of  car  is  being  manu¬ 
factured,  and  each  car  is  made  from  similar  components  using  the  same 
sequence  of  assembly  operations.  Thus,  in  the  course  of  operation  of 
this  system  every  assembly  operation  is  needed  for  the  assembly  of  each 
car,  there  being  no  difference  in  the  sequence  of  steps  needed  for  the 
manufacture  of  the  fourth  or  eighth  car  output  by  the  assembly  line.  To 
give  another  example,  consider  a  pipelined  floating-point  adder,  arranged 
in,  say,  three  stages  (see  Fig.  1.2.1).  Each  stage  performs  a  certain 
operation  on  the  pair  of  operands  input  to  the  pipeline  for  addition. 

Thus,  each  operand  pair  that  is  absorbed  at  the  input  goes  through  the  same 
sequence  of  operations  before  being  output  as  a  result.  Such  a  pipelined 
system  is  also  a  deterministic  system  because  no  choice  has  to  be  made 
between  alternative  courses  of  action.  So  far  we  have  looked  at  systems 
in  which  the  objects  being  assembled  (or  added,  etc.)  go  through  an  i- 
dentical  sequence  of  operations.  A  more  general  class  of  systems  consists 
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of  those  systems  in  which  the  objects  being  processed  do  not  necessarily 
go  through  the  same  sequence  of  operations,  but  the  operation  of  the 
system  is  still  deterministic.  An  example  of  such  a  system  is  an  assembly 
line  for  the  manufacture  of  automobiles  which  alternately  turns  out  two 
kinds  of  autos,  one  fitted  with  a  200  HP  engine  and  the  other  with  a  350 
HP  engine.  We  shall  assume  that  the  car  with  the  more  powerful  engine  has 
to  be  suitably  braced  and  fitted  with  extra  gadgetry,  so  that  after  the 
engine  has  been  bolted  to  the  chassis,  subsequent  assembly  operations  per¬ 
formed  on  the  two  cars  are  different.  Thus,  even  though  there  is  a  point 
in  the  system  at  which  a  decision  has  to  be  made,  this  decision  cannot, 
for  example,  be  made  arbitrarily  by  an  assembly  line  operator.  The  opera¬ 
tion  of  the  system  is,  so  to  speak,  "preordained,"  i.e.,  there  is  no  point 
during  the  operation  of  the  system  at  which  a  choice  can  be  exercised  be¬ 
tween  alternative  courses  of  action. 

Let  us  now  move  on  to  the  class  of  systems  we  have  termed  non-determin- 

istic  systems. 

Non- Deterministic  Systems:  We  will  loosely  define  non-deterministic 
systems  as  those  in  which  there  does  exist  a  choice  between  alternative 
courses  of  action.  An  example  of  such  a  system  is  the  assembly  line  of 
Figure  1.2.2,  minus  the  restriction  that  the  two  types  of  automobiles  be 
manufactured  alternately.  Thus,  at  the  bay  at  which  alternate  routing 
of  che  two  types  of  partially  assembled  autos  was  done,  this  routing  can 
now  be  made  random  or  controlled  by  some  decision  process  other  than_a 
purely  deterministic  one.  The  decision  could,  for  example,  be  made  on 
the  basis  of  up-to-the-minute  customer  orders  received  from  car  dealers. 
The  operation  of  an  assembly  line  such  as  this  is  said  to  be  non-determin- 
istic.  Let  us  now  give  a  more  computer-related  example.  Consider  the 
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simolified  model  of  a  pipeline  processor  shown  In  figure  1.2.3. 


Processing  for  Type  A  Output 


Figure  1.2.3.  A  Simplified  Model  of  a  Pipeline  Processor  for  a  Computer 
with  Three  Instruction  Types. 


We  shall  assume  that  the  instruction  set  of  the  computer  has  three  types 
of  instructions,  termed  types  A,  B,  and  C.  Instructions  are  fetched  at 
Stage  1,  decoded  at  Stage  2  and  processed  according  to  their  type  at 
Stages  3  and  4.  The  reader  will  notice  that  we  have  chosen  the  3ame  nota¬ 
tion  for  representing  both  the  assembly  line  of  figure  1.2.2.  and  the 
pipelined  processor  of  figure  1.2.3.  Both  these  types  of  systems, 
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together  with  a  large  variety  of  others,  all  have  the  basic  characteristic 
that  they  consist  of  a  set  of  parts  together  with  some  mechanism  for  coor¬ 
dinating  their  operation.  In  addition,  the  system  parts  could  be 

arranged  to  exhibit  overlapped  operation  and  parallelism.  Let  us  ex¬ 
plain  these  two  terms  in  the  context  of  Figure  1.2.3.  Each  stage  of  the 
system  is  a  part ,  whose  operation  can  be  overlapped  with  that  of  the 
other  parts.  We  will  say  that  the  parts  of  the  pipeline  processor  operate 
concurrently  or  in  an  overlapped  fashion.  We  now  come  to  parallelism. 
Suppose  Stage  1  consists  of  two  identical  hardware  units,  each  of  which 
can  independently  fetch  an  instruction,  and  the  two  can  operate  concur¬ 
rently  with  respect  to  each  other.  Stage  1  will  be  said  to  have  paral¬ 
lelism  of  degree  2.  In  general,  any  stage  ir  said  to  have  parallelism 
of  degree  n  if  it  has  n  physical  hardware  units  available  for  processing. 

Brief  Statement  of  Thesis  Problem:  The  problem  we  have  addressed  in 
this  thesis  is  to  study  in  some  depth  how  to  model  asynchronous,  concur¬ 
rent  systems  such  as  the  ones  shown  above.  We  have  chosen  a  formalism 
known  as  Petri  nets  [HI,  H2]  to  express  the  sequencing  relationships  be¬ 
tween  events  in  asynchronous  systems.  The  problem  with  Petri  nets,  as 
with  other  models  for  asynchronous  systems  or  parallel  computation  that 
we  could  have  used  is  that  they  represent  only  the  sequencing  or  "cause- 
and-effect"  relationships  between  events  in  a  system.  Such  a  systems 
description  is  not  adequate  if  we  wish  to  study  performance  issues. 

For  example,  the  assembly  lines  and  the  pipeline  processors  modelled  in 
Figures  1.2.1.  through  1.2.3.  are  all  real  world  systems  built  from  com¬ 
ponents  or  devices  which  take  a  certain  amount  of  time  to  operate.  Thus, 
the  production  of  cars  or  the  processing  of  instructions  in  an  actual  sys¬ 
tem  is  not  instantaneous.  The  rate  at  which  processed  objects  appear  at 
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the  output  depends  upon  several  factors,  some  of  which  we  outlined  in 
section  1.1  in  connection  with  an  asynchronous  pipelined  processor. 

Recapitulating,  the  throughput  rate  of  an  asynchronous  processing  system 
will  depend  upon  two  kinds  of  factors: 

(a)  factors  intrinsic  to  the  system,  like  its  structure,  the 
organization  of  its  parts  and  the  speed  of  operation  of  each  part. 

(b)  extrinsic  factors,  like  availability  of  items  in  the  input  queues, 
statistics  on  the  type  of  items  input  to  the  system  for  processing. 

The  only  extrinsic  factor  we  will  model  will  be  statistics  on  the  out¬ 
come  of  decisions  during  the  course  of  operation  of  the  system.  In 
connection  with  Figure  1.2.3,  this  would  mean  the  relative  probabilities 
of  occurrence  of  each  of  the  instruction  types  A,  B  and  C  during  the 
operation  of  the  pipelined  processor.  As  the  reader  will  note,  this 
is  equivalent  to  the  relative  frequency  of  use  of  the  system  parts  which 
perform  the  processing  of  these  instruction  types.  We  will  not  model 
other  extrinsic  factors,  like  arrival  statistics  for  items  in  the  input 
queues.  The  throughput  rate  that  we  thus  calculate  for  a  processing 
system  will  represent  the  maximum  rate  possible,  assuming  that  it  is 
connected  to  a  balanced  configuration  of  primary  and  secondary  memory. 

In  the  next  section,  we  look  at  previous  work  in  the  areas  of  asyn¬ 
chronous  systems,  parallel  computation  and  project  ''heduling  which  is 
relevant  to  the  research  presented  in  this  thesis. 


i 
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1.3  Related  Work 

Considerable  work  has  been  done  in  modelling  the  sequencing  or  control 
aspects  of  asynchronous  systems,  but  few  workers  have  actually  considered 
issues  of  timing  or  speed  of  operation. 

The  work  of  R.  Karp  and  R.  Miller  [Kl,  K2J  is  concerned  with  the 
development  of  a  mathematical  model  called  parallel  progr am  schemata 
to  represent  parallel  algorithms.  A  set  of  uninterpreted  operations  is 
defined  over  a  finite  set  of  memory  cells.  With  each  operation  is 
associated  two  subsets  of  the  memory  cells  called  the  domain  and  range 
cells.  Upon  activation,  callea  an  initiation,  an  operation  reads  the 
current  values  in  its  domain  cells  and  evaluates  a  function  with  these 
values  as  arguments.  An  unspecified  time  later,  the  results  are  deposited 
in  the  range  locations,  the  latter  action  being  called  a  termination. 
Control  is  conceptualized  as  a  possibly  infinite  directed  graph  consisting 
of  a  set  of  control  states  (nodes)  together  with  a  transition  function 
that  specifies  for  each  state  initiation  and  termination  pair  the  suc¬ 
ceeding  control  state.  A  computation  is  a  sequence  of  initiations  and 
terminations  that  corresponds  to  a  defined  path  in  the  control  graph 
(emanating  from  a  designated  starting  state)  and  that  satisfy  other  rules. 

this  formulation,  parallel  activation  of  operations  is  possible, 
but  their  work  on  the  whole  is  more  concerned  with  control  structures 
for  parallel  programs  that  properly  terminate  than  with  the  issues  in¬ 
volved  in  bping  able  to  represent  continuously  operating  asynchronous 
concurrent  systems. 

Dijkstra  [D4]  considers  a  method  by  which  asynchronous  sequential 
processes  may  operate  concurrently  and  communica  e  harmoniously.  The 
processes  are  provided  access  to  common  integer  values  called  semaphores . 
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The  semaphores  can  be  manipulated  by  means  of  two  synchronizing  primitives, 
the  "P"  and  "V"  operations  which  decrement  and  increment,  respectively, 
the  value  of  a  semaphore  by  one.  The  P  operation  can  be  executed  only 
when  the  current  value  of  a  semaphore  is  greater  than  zero.  Conditions 
are  investigated  for  an  ensemble  of  interconnected  sequential  processes 
to  operate  without  being  deadlocked. 

The  work  of  Holt  and  Commoner [H2, Cl]  is  very  significant  and  has  form¬ 
ed  the  starting  point  of  our  thesis.  Their  model,  called  Petri  nets, 
is  very  simple  to  understand,  and  consists  of  two  types  of  nodes  termed 
places  and  transitions.  A  set  of  directed  arcs  connects  places  to 
transitions  or  transitions  to  places.  Markers,  called  "tokens"  are  put 
on  places,  and  each  token  that  is  put  on  a  place  corresponds  to  the 
holding  of  one  instance  of  the  condition  corresponding  to  that  place. 

When  every  input  place  of  a  transition  has  a  token  on  it,  it  can  "fire." 

A  firing  causes  one  token  to  be  removed  from  each  input  place,  and  a 
token  to  be  added  to  each  output  place.  Commoner  has  investigated 
conditions  for  a  subclass  of  Petri  nets  to  operate  in  a  deadlock-free 
manner.  Hack  has  done  an  extensive  investigation  of  a  subclass  of 
Petri  nets  termed  Free  Choice  Petri  Nets  [HI]. 

So  far,  we  have  discussed  models  for  asynchronous  systems  in  which 
no  mention  was  made  of  real  time.  Two  bodies  of  work  exist  in  which 
real  time  issues  are  entered  into  for  asynchronous  concurrent  systems . 

The  first  is  the  PERT  network  used  in  project  scheduling  [FI].  A 
PERT  network  is  an  acyclic  directed  graph  with  an  input  vertex  and  an 
output  vertex.  All  arcs  in  the  system  lie  on  paths  from  the  input  vertex 
to  the  output  vertex.  Each  arc  denotes  an  activity  in  a  project,  and  a 
method  is  given  for  calculating  the  shortest  amount  of  time  that  it 


-19- 


takes  to  complete  the  project  (the  critical  path),  and  a^so  the  earliest 
and  latest  time  that  any  given  activity  can  initiate  in  order  that  the 
project  be  completed  in  the  shortest  time  possible.  While  PERT  networks 
can  explicitly  represent  concurrent  activity  in  a  system,  they  do  not 
have  the  power  to  represent  systems  which  operate  in  a  recurrent  or 
repetitive  fashion.  Nor,  for  that  matter,  can  they  explicitly  represent 
system  components  with  parallelism  within  each  component.  Most  important, 
there  is  no  provision  in  the  structure  of  PERT  networks  for  representing 
a  choice  between  alternative  actions  in  a  system.  This  makes  the  model 
inadequate  for  modelling  the  complex  asynchronous  systems  we  would  like 
to  haidle. 

Martin  and  Estrin  at  UCLA  have  studied  a  model  of  parallel  computation 
called  the  program  flowchart  [ Bl ,  Ml,  M2,  M3].  Program  flowcharts  are  x 
directed  graphs  consisting  of  nodes  and  arcs.  Nodes  represent  operations 
in  a  computer  program  and  arcs  represent  data  paths  between  them.  An 
operation  can  take  place  when  some  suitable  logical  combination  (and, 
exclusive-or)  of  its  incoming  arcs  have  data  values  on  them.  When  an 
operation  takes  place,  it  absorbs  data  values  from  its  incoming  arcs  and 
puts  data  values  on  some  logical  combination  of  outgoing  arcs.  The  model 
is  extremely  powerful  and  can  represent  decisions  explicitly.  Martin  and 
Estrin  are  concerned  with  modelling  parallel  computer  programs,  with  a 
view  to  finding  their  mean  computation  time,  i.e.,  the  amount  of  time  it 
takes  to  execute  a  computer  program  from  start  to  finish.  For  this  pur¬ 
pose,  they  assume  each  node  to  have  a  fixed  (deterministic)  time  duration. 
A  technique  is  given  for  finding  the  mean  execution  time  of  well-formed 
program  flowcharts.  Although  the  model  is  capable  of  representing 
recurrent  processes,  Martin  and  Estrin  have  confined  themselves,  in  their 
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analysis,  to  models  of  processes  which  operate  in  a  "one  shot"  fashion. 
Also,  they  have  not  developed  their  model  to  a  point  where  it  can  be 
used  for  modelling  complex  pipeline  processors  or  assembly  lines.  We 
will  show  in  the  course  of  our  work  the  added  generality  that  is  possible 
with  our  approach  to  modelling  asynchronous  systems. 


1.4  Overview  of  Following  Chapters 


1)  It  Is  shown  how  asynchronous  concurrent  systems  can  be  looked  upon 
as  an  ensemble  of  suitably  interconnected  finite  state  machines.  Petri 
nets  are  introduced  to  formalize  this  notion,  and  also  as  a  graphical 
tool  for  visualizing  the  structure  of  asynchronous  systems.  It  is  shown 
what  it  means  for  an  asynchronous  system  to  be  well-formed  in  term's  of 
the  corresponding  notions  of  liveness  and  boundedness  for  Petri  nets.  The 
two  classes  of  asynchronous  systems,  deterministic  and  non-deterministic , 
are  formalized.  Boundedness  is  shown  to  be  a  decidable  property  for 
Petri  nets,  and  liveness  in  turn  is  shown  decidable  for  bounded  nets. 

(Chapter  2) . 

2)  An  in-depth  discussion  is  given  of  the  structure  of  asynchronous  concurrent 
systems  in  terms  of  SMD  (State  Machine  Decomposable)  Petri  nets.  The 

notion  of  a  "current  assignment"  is  introduced  for  transitions  in  a  Petri  net, 
and  a  consistent  Petri  net  is  defined  as  one  which  can  support  a  consistent  current 
assignment.  Subclasses  of  Petri  nets  are  introduced,  chiefly  Event 
Graphs,  LSP  (Live,  Safe,  Persistent)  Petri  nets  and  SMA  (State  Machine 
Allocatable  )  Petri  nets.  Deterministic  and  non-deterministic  systems  are 
studied  in  terms  of  these  subclasses.  (Chapter  3). 

3)  Timed  Petri  nets  are  introduced.  These  are  Petri  nets  in  which  a 
transition  executes  for  a  fixed  non-zero  time  called  its  firing  time . 

The  maximum  rate  at  which  an  evei t  occurs  in  a  system  is  its  computation 
rate .  The  computation  rate  of  a  large  class  of  deterministic  systems 

is  found.  (Chapter  4). 

4)  The  general  problem  of  finding  the  computation  rate  of  non-deterministic 
systems  is  addressed.  A  bound  is  obtained  for  the  computation  rate  of 
non-deterministic  systems.  Systems  are  identified  for  which  this  bound 


is  achievable.  (Chapter  5). 

5)  Applications  of  our  work  to  the  modelling  and  analysis  of  real-world 
systems  are  given.  (Cb  iter  6). 

6)  Unsolved  problems  are  given,  together  with  recommendations  for  future 
research.  (Chapter  7). 


CHAPTER  2 


PETRI  NETS  AND  VECTOR  ADDITION  SYSTEMS 


2.1  Informal  Introduction  to  Petri  Nets 

We  will  present  the  idea  of  the  Petri  Net  model  as  an  extension  of 
the  finite  state  machine  model  of  conventional  automata  theory.  The 
latter  model  is  based  on  the  assumption  that  we  can  abstract  from  a  sys¬ 
tem  a  total  system  state ,  and  the  action  of  the  system  is  described  by 
the  set  of  all  allowable  states  and  a  set  of  rules  that  governs  the 
transition  of  the  system  from  one  state  to  another.  Let  us  give  a  simple 
example  here  to  explain  what  we  mean. 

Consider  a  stage  of  the  pipelined  floating  point  adder  shown  In 

Figure  2.1.1(a).  Assume  that  it  can  handle  one  pair  of  operands  at  a  time 
and  that  vrtien  it  is  performing  an  operation  on  an  operand  pair,  other 
operand  pairs  in  its  input  queues  must  vait  until  it  is  through  with  the 
current  operand  pair.  We  can  thus  represent  the  operation  of  the  stage 
of  thy  pipelined  adder  as  shown  in  Figure  2.1.1(a). 


Figure  2.1.1(a) 
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The  stage  can  be  in  one  of  three  states,  denoted  by  circles  in  Figure 
2.1.1(a)  : 

(a)  Ready  for  input  operand  pair. 

(b)  Unit  busy  (processing  an  operand  pair). 

(c)  Ready  to  output  result. 

Figure  2.1.1(a)  is  drawn  using  the  notation  of  finite-state  automata 
theory,  and  a  transition  from  one  state  to  another  is  denoted  by  an  arc. 
For  ■'•■usons  that  will  become  clear  soon,  we  will  insert  into  each  arc  in 
Figure  2.1.1(a)  a  bar  to  denote  the  transition  from  one  state  to  another. 
Also  we  would  like  to  indicate  what  state  a  system  is  in  at  any  given 
instant  of  time.  We  will  designate  this  by  the  presence  of  a  marker  on 
the  circle  corresponding  to  that  state  (see  Figure  2.1.1(b)). 


operand  pair 


Figure  2.1.1(b) 


The  system  shown  in  Figure  2.1.1(b)  is  in  the  state  :  "ready  for 
input  operand  pair".  When  the  system  absorbs  an  input  operand  pair,  it 

makes  a  transition  to  the  state  :  "unit  busy".  Finally,  when  it  is  done 
with  processing  the  operand  pair,  it  becomes  ready  to  output  the  result 

and  makes  a  transition  to  the  state  entitled  "ready  to  output  result". 
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The  system  then  outputs  the  result  and  becomes  ready  for  the  next  input 
operand  pair.  This  sequence  of  operations  repeats. 

Each  stage  of  the  system  can  be  modelled  in  this  fashion.  We  now  con¬ 


sider  how  to  model  a  two-stage  pipelined  adder,  given  that  each  stage  is 
modelled  in  this  fashion.  Once  again,  by  using  techniques  of  conventional 
automata  theory,  we  see  that  the  two-stage  system  can  be  represented  by 
a  finite  state  machine  which  is  the  cross-product  of  the  two  machines TH4] . 
Let  us  attempt  to  carry  out  this  construction.  We  will  assume  that  the 
states  of  the  two  machines  are  labelled  a,b,c  and  a^b'jc',  respectively 
(see  Figure  2.1.2). 


a 


Figure  2.1.3  shows  the  resulting  finite  state  machine.  We  see  that  it  has 
nine  states  and  a  total  of  eighteen  transitions  between  states.  The 
problems  with  this  representation  are  the  following: 

(a)  The  number  of  states  grows  as  the  product  of  the  number  of  states 
in  the  individual  stages. 

(b)  The  identity  of  the  individual  stages  has  been  lost.  For  all  we  know, 
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this  could  be  the  state  diagram  of  a  nine-state  system  consisting  of  only 
one  system  component  (or  stage). 

The  nine-state  finite  state  machine  obscures  the  structure  of  our  two- 
stage  pipelined  adder.  We  note  that  If  the  pipeline  has  n  stages,  the 
number  of  states  In  the  finite-state  machine  used  to  represent  it  would 
be  3°  .'  To  represent  a  five-stage  pipeline,  we  would  need  243  states. 

We  clearly  need  a  formalism  which  provides  more  economical  descriptions 
of  concurrent  systems. 

Let  us  briefly  discuss  parallelism  In  a  system  like  a  pipelined  adder. 
Suppose  we  consider  a  two-stage  pipelined  adder  in  which  the  first  stage 
has  five  functional  units  in  parallel,  and  can  thereby  support  up  to 
five  concurrent  computations.  The  second  stage  will  be  assumed  to  have 
eight  parallel  stages.  We  can  represent  parallelism  of  degree  five  in 
the  first  stage  by  placing  a  total  of  five  tokens  on  the  finite-state 
machine  used  to  represent  It.  Similarly,  we  place  eight  tokens  on  the 
finite  state  machine  used  to  represent  the  second  stage.  (See  Figure  2.1.4). 
Each  token  Is  assumed  to  move  from  one  state  to  another  independently 
of  all  other  tokens.  The  state  of  the  first  stage  is  now  a  vector 
(  n(a),  n(b),  n(c)  ),  where  each  element  of  the  vector  represents  the 
number  of  tokens  on  the  corresponding  place.  Note  that  : 
n(a)  +  n(b)  +  n(c)  =  5,  and 

n(a')  +  n(b')  +  n(c')  =  8. 

In  order  to  model  the  first  stage  as  a  finite  state  machine,  3^  states 

g 

are  needed.  Similarly,  3  states  ai  e  needed  for  the  second  stage,  giving 
13 

3  states  in  all  for  the  cross  product  machine  .'  Suppose  instead  of 
attempting  to  model  the  above  system  as  a  finite  state  machine,  we  use 


Figure  2.1.3  Finite  State  Machine  Model  of  Pipelined  Adder 


the  following  artifiee  :  coalesce  t3  and  tj  together,  giving  the  dia¬ 
gram  in  Figure  2.1.5.  The  coalesced  bar  is  relabelled  't'. 

The  output  terminal  of  stage  1  is  thus  made  the  input  terminal  of  stage 
2.  A  transfer  of  an  operand  can  take  place  when  stage  1  is  ready  to  out¬ 
put  a  result  and  stage  2  is  ready  for  an  input  operand.  In  terms  of  the 
net  in  Figure  2.1.5,  this  is  true  when  state  c  in  stage  1  and  state  a'  in 
stage  2  each  have  at  least  one  marker  on  them.  Figure  2.1.6  shows  a  confi¬ 
guration  of  the  pipeline  in  which  an  operand  pair  can  be  transferred  between 

2.1.7  shows  the  configuration  that  results  after  an 


the  stages.  Figure 


operand  pair  has  been  transferred. 

We  are  now  ready  to  Introduce  some  nomenclature.  Each  circle  In  the 

diagram  of  Figure  2.1.5  Is  called  a  place  ,  and  the  bars  are  called  trans 
ltlons.  The  act  of  transferring  an  operand  pair  between  stages  was 
achieved  by  an  action  called  the  firing  of  transition  t. 


Figure  2.1.4. 


a  Figure  2.1.5.  a' 

Figure  2.1.6  shows  a  configuration  of  the  system  in  which  transition  t  can 
be  fl:  red.  When  t  fires,  a  token  is  removed  from  each  input  place  and 

added  to  each  output  place  (see  Figure  2.1.7).  In  system  terms,  each  input 
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place  to  a  transition  represents  a  pre-condition  that  has  to  be  satisfied 
(or  hold)  in  order  that  the  action  corresponding  to  the  transition  may 
take  place.  If  there  are  several  tokens  on  a  place,  it  represents  several 
instances  of  the  holding  of  the  corresponding  condition.  Thus,  when  the 
action  corresponding  to  t  occurs,  it  causes  one  instance  of  each  pre¬ 
condition  to  cease  and  one  instance  of  each  output  condition  or  post¬ 
condition  to  begin  holding.  When  the  finite-state  machines  representing 
Stages  1  and  2  are  not  connected,  the  stages  they  model  are  said  to  be 
mutually  independent.  In  Figure  2.1.5,  the  finite  state  machines  are  said 
to  be  interconnected .  Many  other  terms  can  be  used  instead  of  interconnec¬ 
ted  stages,  for  example,  cooperating  or  mutually  synchronized  stages,  but 
we  will  continue  to  use  the  term  interconnected . 

What  we  have  done  in  the  last  few  paragraphs  is  to  introduce  a  way  of 
modelling,  in  an  economical  way,  systems  of  interacting  parts  in  which 
overlapped  operation  and  parallelism  are  both  present.  The  terminology  we 
have  used  is  part  of  the  definition  of  Petri  net3 ,  which  are  discussed  more 
formally  in  the  next  Section.  We  have  introduced  the  idea  of  Petri  nets 
as  a  natural  extension  to  the  finite  state  machine  model,  and  their  advan¬ 
tages  are  obvious  and  considerable. 

We  would  like  to  continue  our  informal  approach  in  order  to  introduce 
some  important  notions  that  we  will  need  in  understanding  the  structure 
of  asynchronous  systems.  Consider  the  finite-state  machine  in  Figure  2.1.1(b). 
The  state  machine  is  strongly-connected,  i.e.,  from  every  state  there 
exists  a  directed  path  to  every  other  state.  The  state  machine  in  Figure 
2.1.8  is  not  strongly-connect°.d  and  we  see  that  there  is  the  possibility 
that  the  token  can  appear  in  place  p^,  after  which  there  is  no  way  for  the 


token  to  appear  in  either  place  p^  or  place  p^»  What  this  implies  is  that 
once  the  token  appears  in  place  p^,  transitions  t^,t^  ant*  ^3  cannot  fire* 


Figure  2.1.8  A  finite  state  machine  that  is  not  strongly-connected. 


We  have  seen  that  transitions  can  be  used  to  represent  operators  in  an  actual 
system.  In  the  systems  we  are  interested  in,  we  insist  that  every  operator 
always  be  usable.  A  system  component  fchose  operation  can  be  represented  b;- 
the  finite  state  machine  of  Figure  2.1.8  has  certain  operators  (represented 
by  tl’t2  and  whlch  are  n0t  used  after  some  translent  behavior  of  the 
system.  In  the  steady  state,  transitions  t^  and  t,.  fire  alternately,  over 
and  over  again.  We  will  insist  that  each  finite  state  machine  be  strongly- 
connected,  and  the  reader  can  see  that  this  is  necessary  for  the  composite 
system  to  satisfy  the  requirement  that  every  operator  always  be  usable  in 
the  course  of  operation  of  the  system.  If  it  so  happens  that  some  of  the 
operators  can  never  be  used  (i.e.  their  corresponding  transitions  can 
never  be  fired)  ,  then  those  operators  can  be  removed  from  the  system 
without  affecting  its  operation.  Such  operators  are  termed  redundant. 
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The  reader  can  verify  that  in  the  resulting  net,  transitions  t  ,  t  and 

5’  6 

t'  are  redundant. 

6 

A  Petri  net  such  as  the  one  shown  in  Figure  2.1.9(b)  has  no  redundant 
transitions  and  will  be  termed  live.  On  the  other  hand,  in  the  net  shown 
i-n  Figure  2.1.9(c),  transitions  t^,  t^  and  t^  are  redundant,  whereas  all 
others  can  always  be  fired.  Such  a  net  is  termed  pseudo-live.  In  some 
Petri  nets,  the  operation  of  the  net  may  lead  to  a  configuration  in  which 
no  transition  can  be  fired.  Such  a  net  is  termed  non-live.  In  Figure 
2.1.10(b)  we  show  the  construction  of  such  a  net  from  the  two  state  machines 
of  Figure  2.1.10(a) . 


Figure  2.1.9(a) 


In  the  configuration  shown  In  Figure  2.1.10(b),  either  transition  t  or  t 


can  be  fired.  If  t ^  is  fired,  the  net  attains  a  configuration  in  which  no 
further  transitions  can  be  fired  (see  Figure  2.1.10(c)).  The  net  is  an 
example  of  a  non-live  net,  and  the  configuration  shown  in  Figure  2.1.10(c) 
is  a  deadlocked  configuration. 


What  we  have  done  so  far  is  to  motivate  a  formal  study  of  Petri  Nets 
which  we  now  proceed  to  do  in  the  next  section. 


Figure  2.1.10(a) 


Figure  2.1.10(b) 


Figure  2.1.10(c) 
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2.2  Formal  Definition  of  Petri  Nets  and  Relevant  Concepts 

2.2.1  Petri  Nets 

We  are  now  in  a  position  to  introduce  the  reader  to  the  formal  defini¬ 
tion  of  Petri  nets  together  with  the  related  terminology  that  we  will  use 
in  the  rest  of  this  thesis. 

Definition  2.2.1  A  Petri  Net  (P  is  a  three-tuple  <P,T,A> 

where  P  is  a  non-empty  set  of  distinctly-labelled  places  {pi,  P2>***Pn^* 

T  is  a  non-empty  set  of  distinctly-labelled  transitions 

{tp  t2»  ...tm}. 

A  is  a  relation;  it  corresponds  to  a  set  of  arc3,  where  each  arc 

is  either  from  a  place  to  a  transition  or  from  a  transition  to 

a  place  : 

A  C  PXT  U  TXP. 

Definition  2.2.2  A  marking  M  is  a  function  such  that  M:  P  —#•  T),  where  T1 
is  the  set  of  non-negative  integers.  The  non-negative  integer  associated 
with  a  place  represents  the  token  load  of  that  place,  or  the  number  of 
tokens  on  it. 

A  Petri  net  with  a  marking  will  be  referred  to  as  a  marked  Petri  net 
(see  figure  2.2.1  for  an  example). 

Notation  and  Terminology 

A  node  of  a  Petri  net  is  either  a  place  or  a  transition.  We  now  intro- 
a  ce  a  convenient  notation  for  the  predecessor  or  successor  nodes  of  any 
r  ide  in  a  Petri  net.  We  will  refer  to  it  as  the  dot  not  at  ion. 


Figure  2.2.1.  A  marked  Petri  Net 


(p,t)  e  A  is  written  p»t. 
£ y | x - y }  is  written  x* . 
{y|y*x}  is  written  *x. 


Example;  In  figure  2.2.1,  .t2  =  {pi,  p2l. 

PI*  =  Ctj,  t2). 


The  dot  notation  is  also  applied  to  designate  the  successor  or  pre¬ 


decessor  set  of  a  set  of  places  or  transitions.  Thus, 


*{t2,  t4]  =  fpi,  p2,  P3}. 
(P3»  P2i*=  { t2 »  fc3>  *4}- 


A  transition  t  in  a  Petri  Net  is  said  to  be  enabled  iff  every  input 


1 


1 


■Wi 
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place  €  «t  has  at  least  one  token  on  It.  An  enabled  transition  can  be 
fired.  When  a  transition  fires,  a  token  is  removed  from  each  input  place 
and  added  to  each  output  place  pj  €  t*.  Suppose  Ma  is  the  marking  that 
results  when  an  enabled  transition  ta  fires  at  marking  M.  We  write  this 
M  Ma. 

Now  suppose  transition  t^  can  be  fired  at  marking  Ma.  Let  Mb  be  the 

marking  that  results  when  transition  t^,  fires.  We  write 

M  Ma  M*5.  Continuing  in  this  fashion,  let 

us  suppose  that  at  every  new  marking  that  results  *rtien  a  transition  fires, 

at  least  one  transition  can  be  fired.  We  can  write  this 


M  ^-VMa  — ►Mh  — 


The  sequence  of  transition  firings  tatbtc 
sequence.  If  o=  tat|, . tn,  we  write 


M  tatb-..rt^Hn 


or  M 


-*-M  . 


tn  is  termed  a  firing 


Example. 

In  Figure  2.2.2(a)  transition  t2  is  enabled.  Figure  2 .2.2(b)  shows 
the  marking  that  results  when  transition  t2  is  fired  at  marking  M.  The 
reader  can  also  verify  that  t2t^t5t1t3  is  a  firing  sequence  for  the  ret. 

Definition  2.2.3:  A  marking  is  said  to  be  reachable  from  marking  M1 
if  there  exists  a  firing  sequence  a  such  that 


Marking  M 
(a) 


Marking  M' 
(b) 


Figure  2.2.2 


Definition  2.2.4;  The  forward  marking  class  SI  of  a  marking  M  is  the 
set  of  markings  that  is  reachable  from  M.  i.e., 

M  =  {M1  |  3  ct  €  T*  and  M  -Z*  M1}. 

Note:  T*  is  the  set  of  strings  on  the  alphabet 

T=  ffcl*  fc2 *  . t„). 

Notation:  We  have  defined  M  as  a  function  that  assigns  a  token  load  to 
every  place  pi  €  P.  M  can  also  be  looked  upon  as  a  vector,  the  ith  element 
of  which  corresponds  to  the  token  load  of  the  ith  place  Pi  in  the  Petri  net. 

Example :  The  marking  M  of  the  net  in  Figure  2.2.2(a)  can  be  written  as 

M  =  (1,  1,  0,  0) 


The  forward  marking  class  of  M  is  seen  to  be 

{(1, 1,0,0),  (0,0, 1,1),  (0,1, 1,0),  (1,0, 0,1)). 

Definition  2.2.5?  A  marking  M  is  1 ive  for  &  trans it  ion  t  iff  for  every 

marking  M  in  the  forward  marking  class  M  there  exists  a  firing  sequence 

which  fires  t. 

Definition  2.2.6;  A  marking  M  is  live  for  a  Petri  Net  'P  iff  it  is  live 
for  every  transition  in  the  net. 

Definition  2.2.7;  A  marking  M  is  bounded  f or  ji  place  j),  iff  there  exists 

an  integer  N  such  that  for  every  marking  M*€  M,  M*(p)  £  N.  If  N  =  1,  the 

marking  is  safe  for  place  jj. 

Definition  2.2.8*  A  marking  M  is  bounded  (or  safe)  for  the  Petri  net  /p 
iff  M  is  bounded  (or  safe)  for  every  place  in  the  net. 

In  Section  2.1,  we  pointed  out  that  there  exist  marked  Petri  nets  in 
which  a  marking  is  reached  in  which  no  transitions  can  be  fired,  i.e.,  the 
net  can  reach  a  deadlocked  configuration.  Suppose  this  deadlocked  con- 
figuration  is  called  M^,  and  suppose  that  the  net  is  in  some  configuration 
(i.e.,  has  a  marking)  M.  If  M  °  ^M1 ,  then  a  is  said  to  be  a  killing 
sequence  for  the  net  at  the  given  marking  M.  A  net  with  a  live  marking 
has  no  killing  sequence. 

There  is  one  more  important  issue  we  would  like  to  consider  in  this 
subsection.  In  Section  2.1,  we  pointed  out  that  the  only  Petri  nets  we 
will  use  for  modelling  asynchronous  concurrent  systems  will  be  nets  which 
have  a  live  marking.  We  would  also  like  the  marking  for  a  Petri  net  to 
be  bounded.  This  means  that  the  only  Petri  nets  we  would  like  to  consider 
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are  strongly-connected  nets.  Let  us  see  why.  Consider  a  non-strongly 
connected  Petri-net  f .  Then,  there  must  exist  in  the  net  two  portions  of 
the  net  A  and  A*  such  that  all  arcs  between  them  are  directed  from  A  to 
A1,  as  shown  in  Figure  2.2.3.  Each  of  the  nodes  a  and  c  in  A  can  either 
be  a  place  or  a  transition.  Suppose  a  is  a  place.  Then  b  must  be  a 
transition.  Since  the  marking  for  the  net  is  live,  it  means  that  b  can 
be  fired  repeatedly.  But  each  firing  of  transition  b  removes  a  token  from 
place  a.  Now  suppose  we  do  not  fire  transition  b  at  all.  This  would  mean 
that  all  the  tokens  which  were  previously  being  removed  by  firings  of  b 
can  now  stay  on  place  a,  which  means  that  the  number  of  tokens  on  a  can 
become  unbounded. 


Now  suppose  that  a  is  a  transition  and  b  is  a  place.  Since  the  marking 
for  (P  is  live ,  we  can  fire  transition  a  repeatedly,  causing  the  token 
content  of  b  to  become  unbounded.  Thus,  if  Q  is  not  strongly-connected, 


the  token  content  of  some  place  in  r  becomes  unbounded.  Now,  since  we 
are  concerned  only  with  nets  which  have  a  live,  bounded  marking,  all 


In  Section  1.2,  we  said  that  there  are  two  broad  classes  of  systems 
that  we  would  like  to  distinguish  between.  We  now  pursue  that  line  of 
thinking  formally. 

2.2.2  Deterministic  and  Non-Deterministic  Petri  Nets 

We  would  like  to  make  an  important  distinction  between  two  classes  of 
Petri  nets,  which  we  term  deterministic  and  non-determinlstlc .  First,  we 
look  at  a  structural  condition  of  Petri  nets  known  as  conflict. 

Definition:  Consider  a  Petri  net  P  -  ^P,T,A^.  Two  or  more  transitions 

t]_,t2,  . t^  are  said  to  be  in  conflict  if  there  exists  a  place  p  such 

that  p*t^A  p.t2 . AP*tk- 

Note:  'A'  denotes  the  logical  "and"  operator. 


Figure  2.2.4 
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In  Figure  2.2.4  transitions  t^  and  t ^  are  In  conflict  jit  place  p. 

With  the  marking  shown,  a  token  In  place  p^  can  be  removed  by  the  firing 
of  either  transition  t^  or  transition  t^.  Thus,  for  the  given  marking, 
we  have  a  choice  between  firing  either  transition  t^  or  transition  t^, 
and  when  either  transition  fires,  the  other  ceases  to  be  enabled.  Before 
we  can  go  any  further,  we  must  make  a  few  definitions. 

Definition  2.2.9;  A  marking  M  Is  persistent  for  a  transition  t  In  a 
Petri  net  'P  If  t  has  the  property  that  once  it  is  enabled  at  any  marking 
€  M,  It  cannot  cease  to  be  enabled  by  the  firing  of  any  transition 
other  than  Itself. 

Definition  2.2.10:  A  marking  M  Is  persistent  for  a  Petri  net  P  iff  it 
is  persistent  for  every  transition  t  6  P. 

The  net  In  figure  2.2.4  has  the  property  that  It  has  no  marking  which  Is 
persistent  for  transitions  t^  and  t^.  Such  a  net  Is  termed  a  non-deter- 
mlnlstic  net,  and  represents  a  system  In  which  there  is  a  distinct  choice 
between  alternative  actions  (see  section  1.2).  In  contrast,  the  net  In 
figure  2.2.6  Is  a  deterministic  net  or  a  net  with  a  persistent  marking. 

We  now  Introduce  some  convenient  ways  of  referring  to  the  nets  we  have 
beer,  discussing. 

A  net  with  a  Live,  Bounded  or  a  Live,  Safe  marking  will  be  termed  an 
LB  or  LS  net,  respectively.  If  a  Live,  Bounded  marking  is  also  Persistent 
we  will  call  the  Petri  net  an  LBP  net.  A  net  with  a  Live,  Safe,  Persistent 
marking  will  be  called  an  LSP  net.  The  net  in  Figure  2.2.5  is  an  LSP  net. 
Petri  ne;s  are  a  graphical  representation  of  a  mathematical  system 


Figure  2.2.5  A  Petri  net  with  a  live,  safe,  persistent  marking 


known  as  vector  addition  systems .  In  the  next  section,  we  use  some  results 


in  vector  addition  systems  to  prove  the  decidability  of  certain  issues  in 
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2.3  Vector  Addition  Systems  and  their  Relationship  to  Petri  Nets 

In  this  section,  we  give  a  brief  description  of  vector  addition  systems, 
but  a  more  thorough  coverage  can  be  found  in  Karp  and  Miller  [K2]. 

Definition  2,3.1:  An  r-dimanslonal  vector  addition  system  V  is  a  pair 
V  =  (s,W)  where 

(1)  s  €  Nr,  N  =  {0,1,  ...}. 

(2)  W  is  a  finite  set  of  r-dimensional  integer  vectors 
W=  {wj,...^^}  ,  Wj^  €  {0.+1.+2,  ...]r 

The  reachability  set  R(V)  is  the  set  of  vectors  given  by 

R(V)  =  {xjjx^  =  s  +  w^j  +  w^  +  ...  +  w^}  where  w^  €  W. 

a.  w^j  €  W,  1,2, ...lc. 

b.  s  +  wu  +  w12  +  ...  +  wtk  »  0,  k  -  l,2,...t. 

The  reachability  set  is  the  set  of  all  points  that  can  be  reached  by  some 
path  from  s  using  vectors  from  W  and  never  leaving  the  first  orthant. 

Example  2.3.1:  As  an  example  of  a  four  element  vector  addition  system, 
consider 

V  *  (s,w) 

where  s  =  (1,1, 0,0) 

W  =  (wj^  «  (-1,0, +1,0),  w2  =  (-1,-1,+1,+1)  ,  w3  -  (0,-l,0,+l), 
w^  =  (+1,0, -1,0),  w5  =  (0,+l,0,-l)}. 

The  reachability  set  R(V)  of  this  vector  addition  system  consists  of 
four  vectors  {(1,1, 0,0),  (0,0, 1,1),  (0,1, 1,0),  (1,0, 0,1)}. 

Not at  ion 

Let  us  denote  the  vectors  in  the  reachability  set  by  y^, 
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th 


i.e.,  R(v)  =  {yi,  y2,  ...  ym}. 

Also,  if  y^  Is  a  vector,  then  (y^)^  will  denote  the  k*""  element  of  y^. 

We  will  draw  a  directed  graph  with  the  elements  of  R(V)  as  vertices, 
qnd  an  arc  from  vertex  y^  to  y^  if  there  exists  a  vector  w^  €  W  such  that 


j 


y<  +  wi 


Each  arc  will  be  labelled  with  Its  corresponding  vector  w^  above,  and  the 
resulting  diagram  shows  at  a  glance  the  vectors  in  the  reachability  set 
and  how  they  can  be  reached  from  one  another.  We' will  call  this  diagram 
the  reachability  diagram  of  the  vector  addition  system  V. 

As  an  example,  we  give  the  reachability  diagram  corresponding  to  the 
vector  addition  system  of  example  2.3.1. 


(0,1, 1,0) 


5 


(1,0,0, 1) 
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If  the  reachability  set  of  a  vector  addition  system  is  finite,  the  reacha¬ 
bility  diagram  is  a  finite-state  machine,  and  it  is  practical  to  draw  it  as 
we  did  for  our  example.  If,  on  the  other  hand,  the  reachability  set  is 
infinite,  we  must  find  an  alternative  representational  tool.  For  this 
purpose,  we  introduce  the  following  terminology: 

(1)  We  define  a  quantity  to  such  that  if  n  is  an  integer  then  n  <  to,  and 
n  +  to  =  td.  The  quantity  to  essentially  represents  infinity  and  we  discuss 
its  use  shortly. 

(2)  A  rooted  tree  is  a  directed  graph  such  that  one  node  (the  toot  node) 
has  no  arcs  directed  into  it,  each  other  node  has  exactly  one  arc  directed 
into  it  and  each  node  lies  on  a  directed  path  from  the  root  node.  If  p  and 
Y  are  distinct  nodes  of  a  rooted  tree  and  there  is  a  directed  path  from  p  to 
Y,  we  write  P  <  y*  If  there  is  an  arc  directed  from  p  to  y»  then  y  is  a 
successor  of  p.  A  node  with  no  successors  is  called  a  leaf.  With  an  r- 
dimensional  vector  addition  system  V  =  (s,W),  we  shall  associate  a  rooted 
tree  T(V).  Each  node  p  €  T(V)  is  labelled  with  an  r-dimensional  vector  £(P)» 
where  each  element  of  £(p)  belongs  to  the  set  N  fw}.  A  recursive  procedure  is 
given  below  for  the  construction  of  T(V).  Note  that  the  label  for  a  node 

is  assigned  when  the  node  is  added  to  the  tree. 

(1)  Create  a  root  node  and  label  it  s. 

(2)  Let  p  be  a  node  in  the  partially  created  tree  with  no  successors. 

It,  for  some  node  Y»  Y  <  P  and  £(y)  =  £ (p) ,  then  p  is  a  leaf  in  T(V) .  Other¬ 
wise,  for  each  w  €  W  such  that  w  +  £(p)  0,  add  a  node  P  and  make  it  the 

w 

successor  of  p.  For  each  p  the  ith  coordinate  of  4(p  )  is  assigned  as 

w  w 


follows : 
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(i)  If  there  exists  y  <  P  such  that  l(y)  <  jfc(p)  +  w  and 

w 

4(y)<  <  (£(P)  +  w).  then  £(p  )  =  u>. 

i  i  w  i 

(ii)  If  no  such  y  exists,  then  f(p  )  =  (£(p)  +  w).. 

w  i  i 

(3)  Repeat  step  (2)  until  no  new  nodes  can  be  added  to  the  tree. 
We  show  in  Appendix  I  that  for  any  vector  addition  system  V,  the  tree 
T(V)  is  finite. 


Example  2.3.2.  As  an  example  (taken  from  Karp  and  Miller),  consider  the 
following  vector  addition  system  V  =  (s,W). 
s  =  (1,0, 0,0,0) 


W=  {(-1,1, 0,0,0),  (-1,0,0, 1,0),  (0,-1, 2, 0,0),  (0,1, -1,0,0),  (0,0,0, -1,2) 

(0,0,0, 1,-1)}. 

The  rooted  tree  T(V)  is  :  iO.O.u.O.O^O.oi.u.O.O) 

I(0,l,0,0>0)-K0,0,2,0,0)-M0,l,oi,0,0)^  \o, (o,o), 0,0) 


(1,0, 0,0,0) 


/ 

\o, 0,0,1, 


0)-*-(0, 0,0,0, 2:-*«(0, 0,0, 1,(0 


Figure  2.3.2 


(0,o>,u),0,0) 

(0 ,0 , 0 , 0  ,o))— HO ,  0 , 0  ,o)  ,o)) 

(0 , 0, 0  ,o)  ,to)-*»(0 , 0 , 0 ,  o)  ,oi) 
>'^(0,0,0,o),o)) 


All  our  decidable  results  about  vector  addition  systems  reduce  to  inspecting 
the  tree  T(V)  and  using  the  following  theorem. 


Theorem  2.3.1:  For  any  vector  addition  system  V  and  any  integer  vector  x 
of  the  same  dimension 

(ly  €  R(V)  such  that  x  <  y)  «  (1  p  S  T(V)  such  that  x  <  £(p)). 


Proof ;  Given  in  Appendix  I. 


Corollaries 

The  following  questions  are  decidable: 

Corollary  2. 3. 1.1 

V  y  €  R(v)  is  (y)^  £  n  for  some  finite  n? 

Corollary  2.3. 1.2 

V  y  €  R(V) ,  is  a  given  set  of  elements  of  the  vector  y  simultaneously 
bounded? 

Corollary  2.3. 1.3 

Is  R(V)  a  finite  set? 

In  addition,  if  the  questions  in  Corollaries  2. 3. 1.1  or  2. 3. 1.2  are  true, 
the  bounds  can  be  effectively  computed.  For  the  vector  addition  system 
in  Example  2.3.2,  we  see  that  V  y  €  R(V) ,  there  are  elements  which  be¬ 
come  unbounded,  and  hence  R(V)  is  infinite.  Furthermore,  the  elements 
that  can  become  simultaneously  unbounded  are  (3),  (5),  (2,3)  or  (4,5). 

Petri  Nets  and  Vector  Addition  Systems 

It  is  easy  to  see  that  for  every  marked  Petri  net  there  is  a  correspon¬ 
ding  vector  addition  system.  Let  us  explore  this  issue  in  detail. 

Let  be  a  Petri  net  with  initial  marking  M.  We  can  construct  an 
equivalent  vector  addition  system  V  =  (s,W)  as  follows: 

The  dimension  of  the  vector  addition  system  is  made  equal  to  the 
number  of  places  in  the  Petri  net,  and  the  starting  vector  s  is  taken  to 


be  the  initial  marking  of  the  net.  It  now  remains  for  us  to  show  how 


to  construct  the  set  of  vectors  W.  Consider  a  marking  M'  in  the  marking 

class  M,  at  which  a  transition  is  enabled.  Since  the  firing  of  th 

decrements  the  token  content  of  each  of  its  input  places  by  one,  and 

increments  the  token  load  of  each  of  its  output  places  by  one,  we  can 

represent  a  firing  of  t^  by  adding  a  vector  to  the  marking  M'  to  yield 

the  marking  that  results  when  transition  t  is  fired.  The  elements  of  the 

h 

vector  are  given  the  following  values: 

+1  if  the  corresponding  place  is  an  output  place  of  t,  . 

n 

-1  if  the  corresponding  place  is  an  input  place  of  t.. 

h 

0  otherwise  (we  are  assuming  that  there  are  no  self-loops  in  (P  ; 
see  figure  2.3.3). 

ccj 

\ 

Figure  2.3.3.  A  Self-loop 

We  now  assert  that  the  vector  addition  system  V  is  equivalent  to  the 
Petri  net  with  marking  M  in  the  following  sense: 

(a)  For  every  marking  €  M,  there  exists  a  vector  y  €  R(V)  such 
that  y  =  M  . 

q 

(b)  A  vector  addition  sequence  v  is  a  sequence  of  vector  additions 

:  I 

L 
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wj  +  . . .  +  wg  such  that  it  transforms  a  vector  to  vector  yg.  i.e., 

y.  — — y  where 

i  J  s 

v  =  w.  . .  .w  . 
j  s 

We  see  that  corresponding  to  every  vector  addition  sequence  v  there 
exists  a  firing  sequence  a  =  t  ...t  in  the  Petri  net  and  vice  versa. 

J  S 

We  now  establish  some  decidable  results  for  Petri  nets  based  on  the 
decidability  of  the  corresponding  issues  for  Vector  Addition  Systems. 

Theorem  2.3.2  It  is  decidable  if  a  given  marking  M  for  a  Petri  net 
is  bounded. 

Proof :  Let  ?  be  a  Petri  net  and  M  its  initial  marking.  By  the  construc¬ 
tion  above,  we  can  find  an  equivalent  vector  addition  system  V  =  (s,W). 
From  Corollary  2. 3. 1.2,  we  know  that  the  following  is  decidable: 

Given  any  vector  y  €  R(V),  is  a  given  set  of  elements  of  y  simul¬ 
taneously  bounded? 

Thus,  it  is  decidable  if  M  is  bounded  for  the  Petri  net  $  . 

■ 

We  can  now  establish  the  following  result: 

Theorem  2.3.3  It  is  decidable  if  a  given  marking  for  a  Petri  net  is 
live  and  bounded  (i.e.,  if  a  given  Petri  net  is  LB). 

Proof:  Since  we  can  decide  if  the  given  mark-*  j  bounded,  we  proceed 

to  show  that  there  exists  a  decision  pro,  decide  if  a  given 

bounded  marking  is  live.  For  a  bounded  marking,  the  reachability  set 
of  the  corresponding  vector  addition  system  V  is  finite.  Hence,  the 
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reachabllity  diagram  for  V  is  finite.  The  marking  M  is  live  iff  for 

any  marking  M  €  M  there  exists  a  firing  sequence  that  fires  every 
transition  in  the  net.  In  terms  of  the  reachability  diagram,  this 

means  that  starting  at  any  vertex  y^  and  given  a  vector  €  W,  there 

must  exist  a  vector  addition  sequence  that  contains  w.  . 

K 


Figure  2. 3. A 


Consider  figure  2. 3. A.  Find  the  set  of  vertices  {y  ,  y  ,  ...y  }  in 

1  2  m 

the  reachability  diagram  which  have  as  an  outgoing  arc.  We  have  to 
find  if  there  exists  a  directed  path  from  y_^  to  at  least  one  vertex  in 
this  3et •  Since  the  graph  is  finite,  this  can  be  done  by  exhaustion. 
The  procedure  is  repeated  for  every  vector  :  M,  starting  at  the 
vertex  y^  We  now  choose  all  the  vertices  in  turn  and  carry  out  this 
test.  The  total  number  of  tests  involved  is  |w|  X  |r(V)1  where 
|w|  =  the  -lumber  of  vectors  in  the  set  W. 

|R(V)|  =  number  of  vectors  in  the  reachability  set  of  V. 
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Since  both  quantities  are  finite,  the  algorithm  must  terminate.  This 
proves  the  required  result.  ■ 

We  now  prove  another  theorem  for  a  Petri  net  with  a  bounded  marking. 

Theorem  2.3.4  It  is  decidable  if  a  bounded  marking  M  for  a  Petri  net 
is  persistent. 


Proof:  To  check  if  a  bounded  marking  is  persistent,  perform  the  following 
test  for  every  marking  in  the  marking  class: 


Let  M'  be  a  marking  in  the  marking  class  M,  and  let  {t.' ,  t',  . . . t  * }  be 

12  n 

the  set  of  enabled  transitions.  Let  the  firing  of  a  transition 
Ck  ^  ^ea<*  to  t*ie  roaming  M^,  i.e.,  M' 


K- 


For  the  firing  of  each  transition  t^,  we  check  to  see  if  all  other 
enabled  transitions  t|  i  t£  are  still  enabled.  If  not,  the  marking  M' 
is  not  persistent.  If  the  marking  M‘  is  persistent  for  all  of  its 
enabled  transitions  then  we  check  the  next  marking  in  the  marking  class, 
and  so  on  until  all  markings  M1  €  M  have  been  exhausted.  The  marking 
M  is  persistent  iff  all  markings  M'  €  M  are  persistent  for  their  enabled 
transitions.  Since  the  total  number  of  tests  to  be  performed  is  finite 
and  bounded,  the  algorithm  must  terminate.  ■ 

This  concludes  our  discussion  of  decidable  issues  in  Petri  nets. 

We  have  so  far  looked  at  Petri  nets  in  terms  of  their  markings  and  their 
marking  class.  In  the  next  chapter,  we  look  at  Petri  nets  in  terms  of 


their  structure. 


CHAPTER  3 


RELEVANT  RESULTS  FROM  PETRI  NET  THEORY 


3.1  State  Machine  Decomposable  Petri  nets 

In  Section  2.i,  we  saw  how  an  asynchronous  concurrent  system  can  be 
viewed  as  an  ensemble  of  interacting  components,  where  each  component  is 
structurally  similar  to  the  finite-state  machine  of  automata  thory.  Each 
component  has  a  finite  number  of  allowable  states;  since  we  are  concerned 
with  systems  which  have  no  redundant  operators  (see  Section  2.1),  the 
Petri  nets  of  interest  are  LB.  We  will  use  a  restricted  class  of  Petri 
nets  known  as  LB  SMD  (State  Machine  Decomposable)  Petri  nets  to  model 
asynchronous  concurrent  systems.  In  this  Section,  we  formally  define  the 
notions  of  state  machine  and  SMD  Petri  nets. 

Definition  3.1.1  A  closed  subnet  II '  of  a  Petri  net  P  is  a  strongly- 
connected  Petri  net  (P',  T',  A')  where 
P'  CP  is  a  set  of  places , 

T1  C  T  is  a  set  of  transitions , 

A1  C  a  is  a  set  of  arcs ,  such  that 

•P*  =  P'*  =  T',  and 

A'  =  r (P *  x  T1)  U  (T1  X  P')}  n  A. 

The  Petri  net  N  in  Figure  3.1.1  has  five  closed  subnets  N^,  N^,  N^,  N^ 
and  N,..  Clearly,  every  strongly-connected  Petri  net  is  a  closed  subnet 
of  itself,  because  the  relation  *P  =  P*  =  T  is  trivially  satisfied. 
We  are  interested  in  identifying  components  or  parts  of  a  system  that 
can  be  represented  by  a  Petri  net,  and  for  this  reason  we  would  like  to 
define  a  minimal  structure  which  is  part  of  a  Petri  net  -md  is  still  a 
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Figure  3.1.2 
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closed  subnet. 

Definition  3.1.2  A  closed  subnet  is  a  minimal  closed  subnet  if  and 
only  if  no  closed  subnet  can  be  obtained  by  deleting  any  portion  of  it. 
Example  3.1.1  N  is  not  a  minimal  closed  subnet  because  by  deleting 
appropriate  places  and/or  transitions,  the  four  closed  subnets  N^,  N^, 
and  are  obtained.  However,  the  closed  subnets  N^,  N^,  and 
are  minimal  closed  subnets,  as  the  reader  can  easily  verify. 

Example  3.1.2  Consider  the  Petri  net  model  of  a  pipeline  processor 
shown  in  Figure  3.1.2(a).  The  closed  subnets  representing  each  indivi¬ 
dual  stage  are  minimal  closed  subnets  (see  Figure  3.1.2(b)  ).0n  the  other 
hand,  the  closed  subnet  in  Figure  3.1.2(c)  is  not  minimal.  All  the  Petri 
nets  considered  so  far  have  the  property  that  their  minimal  closed  sub¬ 
nets  have  disjoint  places.  Lest  the  reader  be  under  the  impression  that 
this  is  a  requirement  on  minimal  closed  subnets,  we  would  like  to  empha¬ 
size  that  this  is  not  so.  Consider  the  net  in  Figure  3.1.3(a).  It  has 
the  minimal  closed  subnets  S^,  and  shown  in  Figure  3.1.3(b).  We 
will  now  formally  define  the  notion  of  state-machine  based  on  a  suitable 
structural  restriction  on  Petri  nets. 

Definition  3.1.3  A  Petri  net  P  is  a  state-machine  if  and  only  if  every 
transition  has  exactly  one  input  place  and  exactly  one  output  place. 

Example  3.1.3  For  an  example  of  a  state-machine,  see  Figure  3.1.4. 
Definition  3.1.4  A  Petri  net  is  said  to  be  covered  by  a  set  of  closed 
subnets  {  11^,  if  and  only  if 

^  =  <  u  p  ,  u  t  u  a,  > . 

i  1  i  i  i  1 

Example  3.1.4  The  Petri  net  in  Figure  3.1.2(a)  is  covered  by  the  set 


Figure  3.1.4  A  State  Machine 

of  closed  subnets  {S^,  ^  S^} .  Similarly,  the  net  in  Figure  3.1.3(a) 
is  covered  by  the  subnets  of  Figure  3.1.3(b). 

Definition  3.1.3  A  Petri  net  is  SMD  (State  Machine  Decomposable)  iff 
every  minimal  closed  subnet  is  a  state  machine  and  there  exists  a  set  of 
state  machines  {S^,...S^}  which  covers  the  net. 

Example  3.1.5  The  Petri  nets  in  Figures  3.1.2(a)  and  3.1.3(b)  are  SMD. 

On  the  othbr  hand,  the  net  in  Figure  3.1.1(a)  is  not  SMD,  because  the 
minimal  closed  subnets  N^,  and  are  not  state  machines. 

Definition  3.1.6  The  token  content  of  a  Petri  net  (P,  T,  a)  with  a 
marking  M  is  the  number  of  tokens  on  all  places  in  the  net,  and  is  given 

by  N(P)  =  EM(p) 

Pi6  P  1 

Lemma  3.1.1  The  token  content  of  a  marked  state  machine  is  invariant 
under  transition  firings. 


Proof  Suppose  we  consider  a  state  machine  *?  =  <P,  T,  A>  with  mar- 


king  M.  The  only  way  in  which  the  token  content  N(p)  of  the  state  machine 
can  change  is  by  the  firing  of  a  transition.  However,  the  firing  of  a 
transition  removes  exactly  one  token  from  the  net  and  adds  exactly  one 
token  to  the  net.  Thus,  the  firing  of  a  transition  does  not  change  the 
number  of  tokens  on  the  net. 


Theorem  3.1.1  Every  marking  for  an  SMD  Petri  net  is  bounded. 

Proof  An  SMD  Petri  net  has  a  finite  number  of  state  machine  components. 

Let  these  compon  ents  be  S^,...^*  Also,  by  Lemma  3.1.1,  the  token  content 

of  each  state  machine  is  invariant.  Let  the  token  content  of  the  nth  state 

machine  be  N(P  ).  Now,  recall  the  definition  of  a  bounded  marking  for  a 
n 

Petri  net.  We  have  to  show  that  for  each  place  p  in  the  net,  there  exists 

an  integer  Z(p)  such  that  M(p)  £  Z(p)  for  each  M  in  M. 

Let  S  ,,...S  be  the  set  of  state  machines  which  contain  place  p. 
pi  pm 

Then,  if  N(spl) , • • .N(Spm)  are  their  respective  token  contents,  we  see 

that  Z(p)  ^  mint  N(S  .),... N(S  )].  If  ?<p)  were  greater  than 

pi  pm 

min  [  N(S  ,),... N(S  )],  it  would  imply  that  there  exists  a  state  machi- 
p  1  pm' 

ne  such  that  Z(p)  >  N(S^),  which  is  impossible.  Hence,  for  each 

place  p  in  the  net,  we  can  find  an  integer  Z(p)  which  bounds  its  token 
content.  This  proves  the  Theorem. 


Corollary  3. 1.1.1  It  is  decidable  if  any  given  marking  M  for  an  SMD 
Petri  net  'P  is  live. 

Proof  By  Theorem  2.3.3,  it  is  decidable  if  a  bounded  marking  M  for 
Petri  net  P is  live.  Also,  since  any  given  marking  for  'P  is  bounded, 
the  desired  result  follows  immediately. 
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This  result  is  useful  because  it  tells  us  that  we  can  decide  if  a  given 
marking  M  for  a  Petri  net  is  live,  but  it  does  not  give  us  an  effec¬ 

tive  procedure  for  constructing  a  live  marking  for  P .  We  wj.11  look  at 
this  issue  in  Sections  3.3  and  3.5.  Before  we  proceed  to  the  next  Sec¬ 
tion,  we  give  the  reader  an  example  which  illustrates  the  descriptive 
power  of  SMD  Petri  nets.  The  example  of  the  pipeline  processor  we  gave 
in  Figure  3.1.2(a)  was  that  of  a  deterministic  system.  In  Figure  3.1.5, 
we  give  an  SMD  Petri  net  model  of  the  pipeline  processor  for  the  compu¬ 
ter  system  with  three  instruction  types  which  we  discussed  in  Section 
1.2  (see  Figure  1.2.3).  Figure  3.1.6  shows  the  net  in  Figure  3.1.5 
split  up  into  its  state  machine  components,  which  are  labelled  S^  throu¬ 
gh  Sg.  The  reader  will  note  that  the  decision  about  processing  an  instr¬ 
uction  after  it  has  been  decoded  can  be  made  in  the  state  machine  S^. 
Place  p  in  state  machine  S^  has  three  output  arcs,  one  for  each  instruc¬ 
tion  type.  We  said  in  Section  1.2  that  in  order  to  estimate  the  pro¬ 
cessing  rate  of  such  a  pipelined  processor,  we  must  know  the  relative 
frequency  of  occurrence  of  each  of  the  instruction  types  A,  B  and  C.  In 
other  words,  we  must  know  the  relative  frequency  of  occurrence  (or  the 
probability)  of  each  of  the  output  transitions  of  place  p.  Thus,  on  the 
average,  the  rates  at  which  each  instruction  type  occurs  will  be  reflec¬ 
ted  in  the  number  of  times  each  of  the  transitions  t  ,  t,  and  t  fires 

a  b  c 

in  the  long  run.  We  continue  this  train  of  thought  in  the  next  Section, 
where  we  present  the  important  notion  of  consistency.  In  the  meantime, 
the  reader  should  explore  the  descriptive  power  of  SMD  Petri  nets  by 
constructing  txamples  of  his  own. 


3.2  Consistent  Petri  lets 

In  this  Section,  we  will  look  at  the  number  of  times  any  transition 
in  an  LB  Petri  net  can  fire  relative  to  the  other  transitions  in  the 
net.  We  will  first  need  some  definitions. 

Definition  3.2.1  A  current  assignment  for  a  Petri  net  P  =  (P,'T,A)  is 

a  function  $  that  assigns  to  each  transition  t^  6  T  a  positive  rational 
number  cp^  called  its  current.  A  current  assignment  for  a  Petri  net 
must  satisfy  the  following  two  constraints: 

(1)  Every  arc  carries  a  current  equal  to  that  associated  with  the  transi¬ 
tion  it  is  connected  to. 

(2)  At  every  place,  the  sum  of  the  currents  on  the  input  arcs  must  equal 
the  sum  of  the  currents  on  the  output  arcs  (i.e.,  Kirchoff's  current  law) 

Definition  3.2.2  A  Petri  net  is  consistent  iff  it  has  a  current  assign 
ment  5  with  cpt  >  0  for  each  t^  €  T. 

Checking  the  Consistency  of  a  Petri  net 


Figure  3.2.1 


Consider  the  Petri  net  shown  in  Figure  3.2.1.  Let  each  transition  t  in 
the  net  be  assigned  a  current  cp^  For  each  place,  we  can  write  an  equa 
tion  that  specifies  a  constraint  on  the  input  and  output  currents. 


pl: 

-  9i  -  cp2 

ii 

9^ 

+ 

0 

(1) 

P2: 

H 

9- 

+ 

-93 

= 

0 

(2) 

V 

+  92 

-*3 

= 

0 

(3) 

V 

+  93 

-94  = 

0 

(4) 

From  Equations  (2),  (3)  and 

(4)  we  get, 

CPi  « 

V2  -  cp3 

ii 

ii 

-6 

(sa>)  . 

This  violates 

Equation  (1) , 

which  requires 

that 

ii 

ii 

CM 

a 

+ 

H 

9- 

2cp. 

We  conclude  that  the  net  in  Figure  3.2.1  is 

not  consistent,  or 

is 

inconsistent.  A  slight  modification  of  this  net  leads  to  the  consistent 
net  of  Figure  3.2.2. 


Figure  3.2.2 
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i 


The  reader  will  note  that  all  we  have  done  is  to  feed  another  current  of 
magnitude  $  into  the  place  p  .  The  equations  become 


Pi  :  -  *!  -  *2 


+  *4  +  *5 


=  0 


+  ♦, 


=  0 


+  <J>  ~ 

2  3 


+  $ 

3 


+ 


=  0 

-  0 


0 


These  equations  are,  indeed,  consistent  and  we  get  the  following  consis¬ 
tent  current  assignment: 

♦  x  -  <j>2  -  d>3  =  <fr4  -  *5  -  $  ,  where  4.  is  any 

non- zero  real  number. 

Consider  a  consistent  current  assignment  for  a  Petri  net  P  .  Multi¬ 
ply  all  currents  by  the  least  common  multiple  of  their  denominators,  and 
divide  each  resulting  current  by  their  greatest  common  divisor.  The 
resulting  currents  are  said  to  form  a  minimal  integer  consistent  current 
assignment  for  P  .  The  reader  will  note  that  consistency  is  a  purely 
structural  property  of  a  Petri  net.  The  following  Theorem  and  discussion 


explain  the  relationship  between  the  notion  of  consistency  and  the 
structure  of  LB  Petri  nets.  The  ideas  in  the  following  material  have  been 
partly  inspired  by  Baker  [BA]. 


Definition  3.2.3  Let  P  be  a  Petri  net  with  marking  M.  A  cyclic  firing 
sequence  is  a  firing  sequence  a  which  fires  every  transition  of  ^at 
least  once  and  brings  the  marking  of  the  net  back  to  M. 
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Definition  3.2.4  The  firing  count  of  a  transition  t^  in  a  firing  se¬ 
quence  a  is  the  number  of  occurrences  of  t^  in  a  .  The  tiring  vector  'i' 
of  a  firing  sequence  a  is  a  vector  whose  ith  element  (i)  is  the  firing 

count  of  transition  t  . 

i 

Theorem  3.2.1  A  Petri  net  (P  is  consistent  if  and  only  if  there  exists 
an  initial  marking  M  for  which  there  exists  a  cyclic  firing  sequence. 


Proof ;  Necessity 

Consider  the  minimal  integer  consistent  current  assignment  derived  from 
the  given  current  assignment.  Let  this  current  assignment  be  4  .  We  will 
show  how  to  construct  a  finite  firing  sequence  o  whose  firing  vector  f 
is  such  that  tp(i)  i 8  the  current  through  transition  t^.  We  construct 
the  marking  M  as  follows:  M(pk)  must  be  big  enough  so  that  firing  its 
output  transitions  t^  e  i{i(i)  times  does  not  drive  the  token  load  of 
Pk  negative,  i.e.,  MCp^)  =  £*.  M*)  • 

tie  Pk 


♦ 


Under  this  marking,  a  cyclic  firing  sequence  a  is  given  by 


♦  (1) 


'P  (n) 


...  t 


where  t  ...t  are  the  transitions  in  the  Petri  net  .  Since  $  is  a 
1  n 

consistent  current  assignment,  the  firing  sequence  o  is  such  that  for  any 

place  p  ,  the  number  of  tokens  removed  by  a  is  equal  to  the  number  of 
k 

tokens  added  by  a  to  p^.  The  marking  M’  after  a  has  occurred  is  the 
same  as  the  marking  M  before  the  occurrence  of  o  ,  so  that  a  is  a  cyclic 


firing  sequence. 
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Sufflciency  Let  a  be  a  cyclic  firing  sequence  for  the  net,  and  let 
Y  be  the  firing  vector  of  a  .  With  each  transition  t±,  associate  an 
integer  valued  current  equal  to  ^(i).  definition  of  current,  each 
input  and  output  arc  of  t  has  a  current  \|t  (i)  associated  with  it.  Under 
this  assignment,  every  transition  has  been  assigned  a  non-zero  integer 
current,  and  each  transition  has  been  assigned  a  unique  current.  Now 
consider  any  place  p  .  By  the  definition  of  a  cyclic  firing  seqaence, 
has  the  same  number  of  tokens  before  and  after  the  cyclic  firing  seauence 
has  occurred.  This  implies  that  the  sum  of  the  tokens  entering  place  p^ 
is  the  same  as  the  sum  of  the  tokens  leaving  p^,  i.e.,  the  sum  of  the 
input  currents  equals  the  sum  of  the  output  currents  for  every  place  . 
This  is  the  definition  of  a  consistent  current  assignment. 

■ 

We  have  pointed  out  in  Chapter  2  that  LB  Petri  nets  are  the  only  ones 
of  interest  to  us.  We  now  establish  the  connection  between  consistency 

and  LB  Petri  nets. 

Theorem  3.2.2  A  Petri  net  9  which  has  a  live,  bounded  marking  M  is 
consistent . 

Proof  We  have  to  show  the  existence  of  a  cyclic  firing  sequence  which 
fires  every  transition  at  least  once.  Since  the  marking  M  is  bounded,  its 
reachability  diagram  is  a  finite-state  machine.  Also,  since  M  is  live, 
there  must  exist  a  strongly-connected  portion  of  the  reachability  diagram 
that  contains  every  transition  name  at  least  once.  Let  M  be  some  node 
(marking)  in  the  strongly-connected  portion  of  the  reachability  diagram. 
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We  know  that  in  a  strongly-connected  graph,  there  exists  a  chain  (i.e., 
a  sequence  of  directed  arcs,  all  arcs  directed  in  the  same  sense)  that 
starts  and  ends  at  M'  and  which  goes  through  every  arc  once.  This  chain 
can  be  used  to  describe  a  cyclic  firing  sequence,  and  since  *P  has  a 
live  marking,  every  transition  name  must  be  included  at  least  once  in 
this  firing  sequence. 

As  a  Corollary  to  Theorems  3.2.1  and  3.2.2,  we  have  the  following: 
Corollary  3.2.1  If  a  Petri  net  (Pis  LB,  then  there  exists  an  initial 
marking  M  for  which  there  exists  a  cyclic  firing  sequence. 

Proof  Follows  by  combining  Theorems  3.2.1  and  3.2.2. 

For  completeness,  we  mention  that  the  converse  of  Theorem  3.2.2  is  not 
true.  The  net  in  Figure  3.2.3  provides  the  necessary  counter-example.  The 
reader  can  verify  that  any  initial  marking  of  the  net  can  lead  to  a  mar¬ 
king  like  the  one  shown  in  Figure  3.2.3(b),  at  which  no  transition  is 
enabled . 

We  would  like  to  consider  the  connection  between  consistency  and  the 
structure  of  LB  SMD  Petri  nets,  because  that  is  the  class  of  Petri  nets 
we  will  use  for  modelling  asynchronous  systems.  We  will  look  at  the 
concepts  of  cyclic  firing  sequence  and  consistency  for  SMD  Petri  nets 
in  some  depth  in  Section  3.5.  Before  we  can  do  this,  we  look  at  a  det¬ 
erministic  subclass  of  Petri  nets  known  as  event  graphs.  We  do  this  in 
Section  3.3.  In  Section  3. A,  we  complete  our  study  of  those  aspects  of 
Petri  net  theory  which  are  relevant  to  this  thesis. 
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3  ♦  3  Event  Graphs 

Event  graphs  are  what  Commoner  and  Holt  have  called  Marked  GraphsfCl]. 
We  have  chosen  to  call  them  event  graphs  because  we  would  like  to  pre¬ 
serve  the  distinction  between  the  structure  and  marking  of  a  Petri  net. 

In  this  Section,  we  would  like  to  define  event  graphs  and  to  state 
some  results  that  we  will  need  in  Chapter  4,  where  we  look  at  the  idea 
of  timing  relationships  between  activities  in  deterministic  systems. 
Definition  3.3.1  An  event  graph  is  an  SMD  Petri  net  in  which  every 
place  has  exactly  one  input  transition  and  exactly  one  output  transition. 

Example  of  an  event  graph 


Figure  3.3.1  An  event  graph 

Recall  the  definition  of  conflict  given  in  Section  2.2.  It  should  be 
noted  that  no  transitions  in  the  net  are  in  confliet.  Every  marking  for 
an  event  graph  is  persistent,  and  every  event  graph  is  a  deterministic 
Petri  net.  The  structure  of  event  graphs  enables  us  to  define  them  in 
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the  following  convenient  fashion: 

Definition  3.3.2  An  event  graph  is  a  strongly-connected  directed  graph 


G  =  <V,A> 

where  V  =  fv, ,...v  }  is  the  set  of  transitions 
1  m 

A  =  {a  }  CVXV  is  the  set  of  arcs ,  \rtiere  arc  a  conne- 
i  j  J 

cts  transition  v^  to  v^. 


A  marking  of  an  event  graph  is  a  function  that  assigns  to  every  arc  a^ 
in  the  net  a  non-negative  integer  called  the  token  load  of  the  arc.  Note 
that  in  this  definition  of  event  graphs,  an  arc  corresponds  to  a  place 
together  with  its  incoming  and  outgoing  arc  in  Definition  3.3.1  (see 
Figure  3.3.2). 


I — o — * 


An  arc  in  Definition  3.3.2  Corresponding  structure  in 

Definition  3.3.1. 


Figure  3.3.2 

Figure  3.3.3  givos  an  event  graph  corresponding  to  the  one  in  Figure 
3.3.1. 

Event  graphs  are  seen  to  be  conflict-free  SMD  Petri  nets  in  which 


each  simple  circuit  is  a  state-machine  component. 

Theorem  3.3.1  (due  to  Commoner  and  Genrich) 

A  marking  for  an  event  graph  G  is  live  if  and  only  if  the  token  content 
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of  every  simple  circuit  is  non-zero. 

Proof 

Necessity  If  the  token  content  of  some  simple  circuit  is  zero,  no  tran¬ 
sition  in  this  circuit  can  be  fired;  since  the  token  content  does  not 
change  if  other  vertices  are  fired  (lemma  3.1.1),  no  vertex  on  this  cir¬ 
cuit  can  be  enabled  through  transition  firings. 

Sufficiency  Assume  that  the  token  content  of  every  simple  circuit  is 
non-zero.  Let  v^  be  any  transition  in  the  net.  Consider  the  unmarked 
arcs  entering  v^. 


Figure  3.3.4 

If  there  are  none,  the  vertex  is  enabled.  If  not,  consider  the  vertices 
from  which  these  arcs  emanate.  If  each  of  these  is  enabled,  then  clearly 
v^  will  become  enabled  after  all  of  them  are  fired.  If  some  are  not, 
consider  the  unmarked  arcs  entering  them,  etc.  As  we  continue  this 
back-tracking,  we  are  selecting  a  subgraph  of  G  which  consists  of  v^,  the 
vertices  from  which  these  arcs  emanate,  the  unmarked  arcs  entering  them, 
etc.  This  process  must  terminate,  since  G  is  finite.  Now  this  subgraph 
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must  be  circuit-free  (i.e.,  a  tree,  as  shewn  in  Figure  3.2.4),  since 
there  are  no  token-free  simple  circuits.  Thus,  the  subgraph  must  have  at 
least  one  transition  that  has  no  incoming  arcs  which  belong  to  the  sub¬ 
graph.  This  vertex  can  be  fired  at  the  present  marking  of  G.  After  firing 
it,  the  subgraph  of  token-free  back-tracking  from  v^  is  reduced  by  one 
transition.  By  repeating  this  process,  we  can  enable  v^. 

■ 

There  is  one  more  result  we  will  need  in  connection  with  event  graphs, 
the  proof  of  which  is  given  in  [ Cl ] . 

Theorem  3.3.2  A  live  marking  for  an  event  graph  is  safe  if  and  only  if 
every  arc  is  contained  in  some  simple  circuit  containing  exactly  one 
token. 

Example  Figure  3.2.5  gives  an  example  of  an  event  graph  with  a  live, 
safe  marking.  Note  that  every  circuit  has  at  least  one  token  on  it,  and 
that  every  arc  is  contained  in  some  one-token  circuit. 


Figure  3.3.5 
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It  is  easy  to  see  that  event  graphs  have  a  live,  safe  marking  that  is 
persistent,  and  are  thus  deterministic  nets.  However,  event  graphs  are 
not  the  only  Petri  nets  with  this  property.  In  the  next  Section,  we 
study  a  larger  class  of  deterministic  Petri  nets. 
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- LSP  Petri  nets  (Petri  nets  with  a  live,  safe  and  persistent  marking) 

The  class  of  Petri  nets  jS  a  large  one,  but  we  will  confine  our  atten¬ 
tion  to  LSP  Petri  nets  because  their  behavior  can  be  reoresented  by  Petri 
nets  whose  structure  is  very  similar  to  that  of  event  graphs.  The  mark¬ 
ed  Petri  net  in  Figure  3.4.1  is  a  good  example  of  a  Petri  net  which  has 
a  live,  safe  and  persistent  marking  but  which  is  not  an  event  graph.  The 
reader  can  verify  that  this  net  is  LSP  by  drawing  its  reachability  dia¬ 
gram,  which  we  give  in  Figure  3.4.2.  It  is  seen  that  in  no  marking  in  the 
forward  marking  class  can  there  be  more  than  one  token  on  any  place.  Fur¬ 
thermore,  at  each  marking  exactly  one  transition  is  seen  to  be  enabled. 
This  verifies  our  claim  that  the  net  is  LSP.  In  general,  one  can  determine 
if  a  m irked  Petri  net  is  LSP  by  using  the  results  of  Section  2.3.  As  we 
have  pointed  out  before,  the  reachability  diagram  of  a  marked  Petri  net 
obscures  the  concurrency  that  is  inherent  in  the  system  it  represents,  and 
this  is  precisely  what  happened  when  we  drew  the  reachability  diagram  for 
the  net  in  Figure  3.4.1.  We  now  present  an  alternative  means  of  represen¬ 
ting  the  operation  of  LSP  Petri  nets.  We  will  do  this  by  constructing  for 
LSP  Petri  nets  a  graph  known  as  a  behavior  graph,  and  shall  explain  its 
construction  by  means  of  an  example. 


Representing  the  Behavior  of  LSP  Petri  nets:  Consider  the  LSP  Petri  net 

^  shown  in  Figure  3.4.3(a).  We  begin  by  drawing  and  labelling  the  set 

P1  -  p  of  marked  places  in  9  , (in  this  case  {p  ,  p  ,  p j  ) .  Let 

=  ^1,*'2^  ^*e  set  of  enabled  transitions  corresponding  to  P  ,  and 

note  that  P^  corresponds  to  the  initial  marking  of  &  .  Let  ?2  be  the  set 

of  marked  places  that  results  when  all  transitions  in  T^  are  fired.  Draw 
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all  arcs  P  X  T  that  are  contained  in  P  *  T .  Draw  the  places  P'  =  T'*. 

1  1  1  1 

Draw  all  the  arcs  x  Pj  which  are  contained  in  T  X  P. 

Define  =  (P  -‘t  )  X  . 

This  process  of  constructing  the  marked  places  P  that  results  when 
all  enabled  transitions  for  P^  are  fired  is  called  extending  the  behavior 
graph  from  P^  to  P^+i  *  Since  P  has  a  live  marking,  its  behavior 
graph  can  be  extended  indefinitely.  Figure  3.4.3(b)  shows  the  behavior 
graph  of  ?  .  We  now  make  some  definitions. 

Definition  3.4.1  A  chain  in  a  behavior  graph  is  any  directed  path  in 
it. 

Example  p^t^p^t^p^t^. . .  is  a  c^a:i-n  in 

Definition  3.4.2  A  slice  of  a  behavior  graph  ®  is  a  set  of  places 
that  forms  a  cut-set  of  0 . 

Note  In  a  cut-set,  no  two  elements  of  the  set  can  belong  to  the  same 
chain . 

Example  P  =  {p^,  p^,  p^\  is  a  slice  of 

Each  slice  of  a  behavior  graph  corresponds  to  a  marking  of  the  LSP  Petri 
net. 

Lemma  3.4.1  Each  place  in  a  behavior  graph  for  an  LSP  Petri  net  must 
have  exactly  one  input  transition  and  exactly  one  output  transition. 

Proof  Suppose  some  place  p  in  the  behavior  graph  S  has  more  than  one  input 
transitio" .  Then,  there  must  exist  a  marking  M'  in  the  marking  class  of 
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the  net  Pin  which  more  than  one  token  is  placed  on  p^,  implying  that  P 
has  a  marking  that  is  not  safe.  Hence,  every  place  in  £  has  exactly  one 
input  transition. 

Also,  suppose  some  place  p  in  P  has  more  than  one  output  transition. 
This  implies  that  there  exists  a  marking  M'  in  the  forward  marking  class 
for  P  at  which  more  than  one  output  transition  is  enabled.  This  violates 
the  assumption  that  P  has  a  persistent  marking.  Hence,  each  place  in  P 
has  exactly  one  input  and  exactly  one  output  transition. 

Lemma  3.4.2  There  exists  a  slice  in  the  behavior  graph  £  of  an  LSP 
Petri  net  P  that  occurs  repeatedly. 

Proof  Each  slice  of  £  corresponds  to  a  marking  cf  P  .  Since  P  has  a 
safe  marking,  the  number  of  distinct  markings  in  the  marking  class  for 
the  net  is  finite.  Therefore,  since  the  behavior  graph  is  infinite, 
there  must  exist  a  slice  in  £  that  occurs  repeatedly. 


The  reader  is  now  asked  to  refer  to  Figure  3.4.3.  The  slice  P^  repre¬ 
sents  the  initial  marking  of  P.  P^^  does  not  occur  again  in  ft  .  A 
behavior  graph  that  has  slices  which  do  not  repeat  is  said  to  have  a 
transient,  and  such  slices  are  termed  transient  slices .  In  te  rms  of  the 
Petri  net  P  ,  a  transient  slice  represents  a  marking  that  cannot  be  -ca¬ 
ched  after  an  initial  occurrence.  The  slice  P^  (shown  dotted)  is  a  slice 
which  has  repeated  occurrences  in  £  .  A  cyclic  frustrum  of  ft  is  the  por¬ 
tion  of  'S  b  etween  two  consecutive  occurrences  of  some  repeated  slice.  In 
Figure  3.4.3(c),  we  show  a  cyclic  frustrum  of  Abounded  by  consecutive 

occurrences  of  the  slice  jp  ,  p,,  p  \  in  ft  .  Since  ft  is  derived  from 

1  4 

a  persistent  marking  of  ft  ,  only  one  way  is  possible  of  extending  this 
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t0  ^P^>  p4’  P6^  *  general»  for  any  slice  of  the  behavior  graph, 

there  exists  a  unique  extension  P  .  Thus,  every  cyclic  frustrum  bounded 

k+1 

at  each  end  by  the  slice  *p  ,  P^»  P^l  Is  identical  to  every  other  cyclic 
frustrum  bounded  at  each  end  by  this  slice.  Hence,  instead  of  drawing  an 
infinite  behavior  graph  'Ji  ,  we  will  choose  some  cyclic  frustrum  of  fb  and 
then  coalesce  corresponding  slices  together  in  the  initial  and  terminal 
slice  of  the  cyclic  frustrum.  The  net  so  obtained  is  termed  the  steady- 
state  equivalent  net  of  P  ,  and  is  shown  in  Figure  3.4.3(d).  The  marking 
M  for  the  steady— state  equivalent  net  $  is  obtained  by  putting  one 
token  on  each  place  in  the  initial  slice.  The  construction  of  this  net 
is  such  that  the  set  of  firing  sequences  of  &  is  identical  to  that  of 
the  net  9  .  Also,  the  reachability  diagram  of  J  is  contained  in  that 
for  9  ,  there  possibly  being  some  extra  states  in  ^corresponding  to  the 
transient.  The  reader  can  see  this  from  the  graph  9  ,  and  it  is  not 
necessary  to  construct  the  reachability  diagram  of  P  . 

We  have  shown  that  in  9) ,  and  hence  in  £  ,  each  place  has  exactly 
one  input  transition  and  exactly  one  output  transition.  thus  has  the 
structure  of  an  event  graph,  with  the  difference  that  certain  place  and 
transition  names  occur  more  than  once  in  it.  Transition  t^  and  places 
P3  a°d  occur  twice,  for  example.  Such  an  event  graph  is  termed  a 
multiply-_labelled  event  graph .  We  will  not  define  multiply-labelled  event 
graphs  formally,  but  will  merely  say  that  they  are  event  graphs  in  which 
certain  places  and  transitions  have  repeated  occurrences  (or  instances) . 

The  LSP  Petri  net  9  has  the  property  that  it  is  possible  to  add  tokens 
to  certain  places  and  still  have  a  marking  that  is  live  and  bounded,  but 
no  longer  safe.  Tokens  can  be  added  to  all  places  except  p^  and  p  ,  such 
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that  che  resulting  marking  is  live,  bounded  and  persistent.  The  steady 
state  equivalent  net  for  9  with  this  new  marking  is  simply  the  net  £  ; 
the  marking  for  is  constructed  by  adding  the  same  number  of  tokens  to 
a  place  in  S  as  were  added  to  the  corresponding  place  in  9  .  If  there  are 
multiple  instances  of  a  place  in  $  ,  then  tokens  can  be  added  to  any  of 
those  instances,  provided  the  sum  of  the  tokens  added  to  all  instances 
of  a  place  equals  the  number  of  tokens  added  to  the  corresponding  place 
in  9  .  Figure  3.4.4(a)  shows  a  live,  bounded,  persistent  marking  for 
and  Figure  3.4.4(b)  its  corresponding  steady-state  equivalent  net  d  . 

We  should  mention  that  there  exist  Petri  nets  which  have  a  live, 
bounded  marking  but  no  live,  safe  marking.  Figure  3.4.5  shows  such  a 
Petri  net.  Thus,  the  preceding  technique  car.not  be  used  to  construct  a 
steady-state  equivalent  net. 


P1 


t 


1 


Figure  3.4.5 
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3.5  The  Structure  and  Consistency  of  SMD  Petri  nets 

3.5.1  Existence  of  a  live  marking  for  an  SMD  Petri  net 
In  Section  3.3,  we  saw  that  for  event  graphs  a  necessary  and  sufficient 
condition  for  a  marking  to  be  live  is  that  there  be  at  least  one  token  on 
each  simple  circuit.  Since  a  simple  circuit  in  an  event  graph  corresponds 
to  a  state  machine  in  an  SMD  Petri  net,  the  reader  may  be  tempted  to  ask 
if  we  can  get  a  live  marking  for  an  SMD  net  by  adding  at  least  one  token 
on  each  state  machine  component.  The  nets  given  in  Figure  2.1.9(c)  and 
2.1.10(c)  are  counter-examples  to  this  conjecture.  In  Figure  2.1.9(c),  a 
subset  of  the  transitions  in  the  net  can  never  be  fired,  whereas  in 
Figure  2.1.10(c),  no  transition  can  be  fired.  Both  are  examples  of  Petri 
nets  which  do  not  have  a  live  marking.  We  now  give  a  necessary  condition 
for  a  marking  M  for  an  SMD  Petri  net  P  to  be  live. 


Theorem  3.5.1  A  marking  M  for  an  SMD  Petri  net  is  live  only  if  the 
token  content  of  every  state  machine  is  non-zero. 

Proof  We  will  show  that  if  the  token  content  of  some  state  machine  is 
zero,  then  no  transition  in  it  can  be  fired.  Without  loss  of  generality, 
let  be  a  state  machine  component  of  *P  and  let  t^  be  a  transition  in 
it,  i.e.,  if  S^  =  <P^,  T^,  A^>  then  t^  e  T^.  By  hypothesis,  the  input 
place  p^  of  transition  t^  in  state  machine  S^  must  be  unmarked.  Also,  by 
Lemma  3.1.1,  it  must  stay  unmarked.  This  implies  that  there  is  no  marking 
M'  e  M  at  which  transition  t  can  be  fired.  Hence,  the  marking  M  is  not 
live.  Note  that  no  transition  in  S  can  be  fired  at  any  marking  M'  e  M. 


Hack[H5]  has  shown  that  a  subclass  of  SMD  Petri  nets  called  SMA  (  state 
machine  allocatable)  Petri  nets  are  similar  to  marked  graphs  in  that  any 


-87- 


marking  that  puts  at  least  one  token  on  every  state  machine  is  live.  We 
proceed  to  describe  the  structure  of  these  nets  in  what  follows.  The  ma¬ 
terial  in  Subsection  3.5.2  is  taken  from  Hack[hl,  H5 ] . 

3.5.2  State  Machine  Allocatable  Petri  nets 

Definition  3.5.1  A  state  machine  allocation  over  a  Petri  net  <\P,T,A) 
is  a  function  B:  T  — ►  P  such  that 
V  t  <r  T  B(t)  e  't. 

Informally,  this  means  that  for  each  transition  in  the  net,  we  pick 
one  of  its  input  places  and  ignore  the  others.  Given  such  a  state  machine 
allocation,  we  perform  a  reduction  by  deleting  certain  places  and  transi¬ 
tions  in  the  following  manner: 

Step  1  Delete  all  places  for  which  at  least  one  output  arc  has  been 
deleted. 

Step  2  Delete  all  transitions  that  have  all  output  places  already  deleted. 
Repeat  Steps  1  and  2  until  neither  is  applicable  anymore.  What  is  left 
over  is  the  reduced  net.  Each  step  eliminates  some  nodes  and  arcs  that  are 
not  part  of  the  reduced  net,  until  no  more  nodes  and  arcs  can  be  deleted. 

A  Petri  net  is  said  to  be  state-machine  allocatable  iff  every  state 
machine  allocation  gives  a  reduced  net  that  is  either  a  strongly-connected 
state  machine  or  a  set  of  strongly-connected  state  machines.  We  will 
abbreviate  the  last  remark  by  using  the  contraction  "SSM"  to  denote  "a 
strongly-connected  state  machine  or  a  set  of  strongly-connected  state 
machines".  In  Figure  3.5.1(a)  we  give  an  example  of  a  Petri  net  which  is 
state-machine  allocatable.  In  Figure  3.1.5(b),  we  show  two  allocation 
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reduced  SSM's.  Reduced  net  No.  1  results  from  allocating  t  to  and 
reduced  net  No.  2  results  from  allocating  t^  to  p,..  Since  each  of  the  two 
allocation  reduced  nets  is  an  SSM,  the  net  is  SMA. 

Let  us  give  another  example.  Consider  the  Petri  net  in  Figure  3.5.2. 
Since  each  of  the  state  machine  allocation  reductions  is  an  SSM,  the  net 
is  SMA. 

We  now  give  an  example  of  a  Petri  net  which  is  SMD  but  not  SMA.  Such  a 
net  is  given  in  Figure  3.5.3.  We  see  that  two  of  the  SMA  reductions  are 
SSM's  and  the  other  two  are  null;  the  net  is  not  SMA.  In  case  the  reader 
is  a  little  confused,  we  vould  like  to  point  out  that  the  property  of  a 
Petri  net  being  SMA  is  a  structural  property  of  the  net,  and  he  will  note 
that  no  mention  has  been  made  of  markings  so  far. 

The  reason  we  have  introduced  SMA  Petri  nets  is  that  they  form  the 
largest  subclass  of  SMD  Petri  nets  we  know  of  that  has  Lne  property  that 
any  marking  which  puts  at  least  one  to^en  on  each  state-machine  component 
is  live.  This  is  the  result  presented  in  the  next  Theorem.  We  present 
this  Theorem  without  proof  and  readers  interested  in  the  details  are  refe¬ 
rred  to  Hack[Hl,H5]. 

Theorem  3.5.2  (  The  SMA  Petri  net  liveness  Theorem)  A  marking  M  for 
an  SMA  Petri  net  P  is  live  iff  it  puts  at  least  one  token  on  each  state- 
machine  component  of  the  net. 


This  Theorem  is  of  interest  because  it  tells  us  how  to  construct  a  live 
marking  for  any  SMA  Petri  net-  simply  put  one  or  more  tokens  on  each  state 
machine  component  of  the  net.  The  non-SMA  Petri  net  shown  in  Figure 
3.5.3(a)  also  has  the  property  that  any  marking  that  puts  at  least  one 
token  on  every  state  machine  is  live,  as  the  reader  can  easily  verify. 
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Figurc  3.5.3(c)  shows  the  Petri  net  of  Figure  3.5.3(c)  with  a  live  mar¬ 
king.  We  present  the  following  as  an  open  problem: 

What  is  the  largest  class  of  SMD  Petri  nets  which  has  the  property 
that  any  marking  that  puts  at  least  one  token  on  every  state  machine 
component  is  live? 

We  now  turn  to  examining  the  issue  of  consistency  for  SMD  Petri  nets. 

3.5.3  Consistency  of  SMD  Petri  nets  which  have  a  live  marking 

Recall  Corollary  3.2.1,  which  we  repeat  here  for  convenience: 

Corollary  3.2.1  If  a  Petri  net  P  is  LB,  then  there  exists  an  initial 
marking  M  for  which  there  exists  a  cyclic  firing  sequence. 

In  Section  3.4,  we  established  the  connection  between  a  cyclic  firing 
sequence  and  the  steady-state  equivalent  net  for  event  graphs  and  LSP 
Petri  nets.  We  now  introduce  a  concept  similar  to  the  steady-state  equi¬ 
valent  net  in  connection  with  li^e  SMD  Petri  nets.  A  firing  sequence,  as 
we  have  pointed  out  in  Section  3.4,  expresses  an  ordering  relation  on  tra¬ 
nsition  firings  in  a  fashion  which  obscures  the  concurrency  that  is  in¬ 
herent  in  the  Petri  net.  To  preserve  this  inherent  concurrency,  we  intro¬ 
duced  the  behav’or  graph  for  LSP  Petri  nets.  We  will  now  introduce  a  more 
general  concept  to  study  the  behavior  of  SMD  Petri  nets-  the  occurrence 
graph.  This  notion  is  not  a  new  one,  having  been  studied  extensively  by 
Holt[H6].  As  we  did  for  behavior  graphs,  we  shall  illustrate  the  cons¬ 
truction  of  an  occurrence  for  an  LS  SMD  Petri  net  by  means  of  an  example. 
We  begin  with  a  live,  safe  marking  for  an  SMD  Petri  net(see  Figure  3.5.4). 


Construction  of  Occurrence  Graph 


Begin  by  drawing  every  marked  place 
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in  the  net  together  with  a  token  on  each  place  (i.e.,  places  p^  and  P^). 

’Jow,  since  a  place  may  have  more  than  one  output  transition,  we  may  have 

several  enabled  transitions  connected  to  the  same  place.  In  event  graphs 

and  LSP  Petri  nets,  we  saw  that  every  place  has  at  most  one  transition 

enabled  at  any  marking.  This  is  not  necessarily  true  for  SMD  Petri  nets. 

As  an  example,  consider  the  marked  Petri  net  of  Figure  3.5.4.  Both  the 

output  transitions  t  and  t.  of  place  p  are  seen  to  be  enabled;  since  we 

5  6  5 

can  fire  only  one  of  these  transition:  we  must  make  a  choice  between  them 
and  fire  the  one  we  choose.  We  will  extend  a  place  in  an  occurrence  graph 
by  drawing  the  output  transition  chosen  for  firing,  firing  that  transition 
and  then  drawing  the  new  marked  place (s)  that  result.  A  slice  of  an  occu¬ 
rrence  graph  is  a  set  of  places  that  forms  a  cut-set  of  the  graph.  We  can 
talk  about  extending  a  slice  in  the  same  way  as  we  did  for  behavior  graphs, 
the  difference  being  that  a  choice  may  have  to  be  made  between  enabled  tra¬ 
nsitions.  In  the  construction  of  a  behavior  graph,  there  never  occurs  a  slice 
for  which  a  choice  has  to  be  made  between  output  transitions.  Thus,  there  is 
only  one  behavior  graph  for  au  event  graph  or  an  LSP  Petri  net,  and  this 
graph  is  unique.  On  the  other  hand,  several  occurrence  graphs  may  be  possi¬ 
ble  for  LS  SMD  Petri  nets.  In  Figure  3.5.5,  we  show  two  possible  occurr¬ 
ence  graphs  for  the  net  in  Figure  3.5.4.  The  reader  will  realize  that  an 
infinite  number  of  occurrence  graphs  is  possible  for  this  net,  or,  for  that 
matter,  in  any  LB  net  with  a  non-persistent  marking. 

Now  let  us  apply  Corollary  3.2.1  to  the  occurrence  graph  for  an  LS 
SMD  Petri  net.  An  occurrence  graph  is  a  concurrent  representation  of  a 
firing  sequence  for  a  Petri  net,  and  each  slice  represents  a  marking 
of  the  net.  A  repeated  slice  thus  represents  the  repeated  occurrence  of 
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some  marking.  The  portion  of  an  occurrence  graph  between  two  consecutive 
occurrences  af  a  slice  is  termed  a  cyclic  frustrum.  and  corresponds  to  a 
cyclic  firing  sequence  for  the  marked  net.  Theorem  3.2.1  tells  us  that 
for  any  consistent  current  assignment  for  an  LB  Petri  net,  we  can  find 
a  cyclic  frustrum  in  the  occurrence  graph  of  the  net.  The  number  of 
occurrences  of  any  transition  in  the  cyclic  frustrum  equals  its  current 
in  a  consistent  current  assignment.  4 

Example  3.5.1  Consider  the  SMD  Petri  net  of  Figire  3.5.4.  In  Figure 
3.5.6,  we  show  this  Petri  net  with  a  minimal  integer  consistent  current 
assignment.  We  draw  the  occurrence  graph  as  discussed  earlier,  and  Figure 
3.5.7  shows  a  cyclic  frustrum  of  this  occurrence  graph,  in  which  the  multi¬ 
plicity  of  each  transition  equals  its  associated  current  in  the  consistent 
current  assignment  exhibited  in  Figure  3.5.6.  We  now  coalesce  correspon¬ 
ding  places  in  the  repeated  slic?  in  a  manner  similar  to  what  we  did  for 
behavior  graphs.  The  resulting  strongly-connected  net  is  termed  a  consis¬ 
tency  -equivalent  net  for  the  SMD  Petri  net,  abbreviated  to  "c-equiva- 
lent  net"  .  The  c-equivalent  net  for  the  cyclic  frustrum  of  Figure  3.5.7 
is  shown  in  Figure  3.5.8. 

Let  us  now  note  '  ome  facts  about  the  relationship  between  an  SMD  Petri 
net  and  its  c-equivalent  Petri  net.  Let  us  begin  by  saying  that  we  will 
consider  only  minimal  integer  consistent  current  assignments.  Note  that 
the  c-equivalent  net  of  an  SMD  Petri  net  P  is  not  unique.  Figure  3.5.9(a) 
shows  an  SMD  Petri  net  'P  with  a  minimal  integer  consistent  current  assi¬ 
gnment.  The  net  (P  has  several  c-equivalent  nets,  two  of  which  are 


shown.  We  observe  that  in  the  construction  of  a  c-equivalent  net  for  an 
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SMD  Petri  net  'P  with  a  minimal  Integer  consistent  current  assignment, 
each  state-machine  in  *P  corresponds  to  a  circuit  in  the  c-equivalent  net. 
We  formalize  this  in  the  following  Lemma. 

Lemma  3.5.1  Let  P  be  an  LS  SMD  Petri  net  with  a  minimal  integer  consis¬ 
tent  current  assignment  $  ,  and  let  €  be  a  c-equivalent  net  for  P  .  Then, 
every  state  machine  component  in  !P  corresponds  to  a  simple  circuit  in  6  . 
Proof  Let  $  be  the  cyclic  frustrum  corresponding  to  «/  .  Applying 
Theorem  3.5.1,  we  see  that  the  initial  slice  of  ?  must  contain  at  least 
one  place  from  each  state-machine  in  (P  .  Now  consider  a  chain  in  7  boun¬ 
ded  at  its  extremeties  by  two  consecutive  instances  of  some  place  p  .  In 
,  this  chain  corresponds  to  a  simple  circuit.  By  the  construction 
procedure  for  occurrence  graphs,  it  is  obvious  that  each  state  machine  in 
corresponds  to  a  chain  in  3-  .  This  proves  the  Lemma. 

Example  Consider  the  LS  SMD  Petri  net  of  Figure  3.5.6.  Each  of  the  two 
state  machines  corresponds  to  a  circuit  in  the  c-equivalent  net  shown  in 
Figure  3.5.8.  Note  that  there  are  two  circuits  in  the  c-equivalent  net 
which  do  not  correspond  to  any  state-machine  in  the  SMD  Petti  net.  An 
example  of  such  a  circuit  is  P2t4P5t ^t^t^t^t^  . 

Since  the  c-equivalent  net.  corresponds  to  a  cyclic  firing  sequence  for  an 
LB  SMD  Petri  net,  we  see  by  applying  Theorem  3.2.1  that  the  multiplicity 
of  a  transition  in  the  c-equivalent  net  must  equal  its  current  in  a  mini¬ 
mal  integer  consistent  current  assignment. 

To  recapitulate  the  main  results  of  this  Section,  we  have  the  following: 
(1)  We  hav°  introduced  the  subclass  of  SMD  Petri  nets  known  as  SMA  Petri 
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nets,  which  have  the  property  that  a  marking  is  live  if  and  only  if  each 
state  machine  contains  at  least  one  token.  This  is  the  largest  class  of 
SMD  Petri  nets  known  to  date  that  has  this  property . 

(2)  We  have  introduced  the  c-equivalent  net  of  a  Petri  net  which  has  a 
live,  bounded  marking.  For  SMD  Petri  nets,  we  have  seen  that  every  state- 
machine  corresponds  to  a  circuit  in  a  c-equivalent  net.  The  converse  is 
not  true  in  general. 

We  now  turn  to  the  issue  of  applying  Petri  net  theory  to  the  analysis 
of  asynchronous  concurrent  systems. 


CHAPTER  4 


TIMED  PETRI  NETS 

4.1  Timing  In  Petri  Nets 

In  our  discussions  so  far,  we  have  not  entered  into  any  timing  con¬ 
siderations  in  connection  with  Petri  nets.  Thus,  while  SMD  Petri  nets 
can  model  the  structure  of  asynchronous  concurrent  systems,  they  do  not 
contain  enough  information  to  be  used  for  a  study  of  issues  of  perfor¬ 
mance  of  the  type  discussed  in  Chapter  1.  In  defining  Petri  nets,  we 
made  no  assumptions  about  the  length  of  time  it  takes  for  a  transition 
to  fire.  In  real-world  systems,  activities  do  not  take  place  instan¬ 
taneously.  Every  activity  in  a  system  has  a  time  duration  which  is 
different  from  zero,  and  in  all  the  systems  we  will  model,  we  will 
make  the  added  assumption  that  all  activities  complete  in  a  finite 
amount  of  time.  In  the  Petri  nets  that  we  use  to  model  these  systems, 
we  will  assume  that  every  transition  takes  a  bounded,  non-zero  amount 
of  time  to  fire.  The  resulting  model  of  asynchronous,  concurrent  systems 
is  termed  Timed  Petri  nets ,  and  is  formalized  below. 

Definition  4.1.1  A  Timed  Petri  Net  is  a  pair  (  , Cl)  where  ^  is  a 

Petri  net  (P,T,A)  and  Cl  is  a  function  that  assigns  a  real,  non¬ 
negative  number  T ^  to  each  transition  t^  in  the  net. 

ft  J  T— — — — *  "R.  f  ®  is  the  set  of  non-negative  real 

numbers  }  . 

This  non-negativs  real  number  T  ^  =  ft^)  is  termed  the  firing  time 

of  transition  t^. 
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The  operation  of  the  net  can  now  be  assumed  to  take  place  in  real  time. 

At  any  instant  T  of  real  time,  the  net  has  a  marking  M(r),  with  the 
understanding  that  we  may  view  M  both  as  a  vector  and  a  function.  We 
denote  the  initial  marking  as  M(0).  We  write  M(T,p)  for  M(T)(p),  i.e., 
the  number  of  tokens  on  place  p  at  time  T.  A  transition  t^  is  said  to 
be  enabled  at  time  T  if  and  only  if  every  input  place  of  transition  t^ 
has  at  least  one  token  on  it,  i.e., 

M(T  »pk>  *  1  for  a11  Pk  ^  't t • 

When  transition  t^  is  enabled,  a  firing  can  be  Initiated.  When  a  firing 
is  initiated,  a  token  is  removed  from  each  input  place  of  and 

transition  t^  is  said  to  be  executing.  This  execution  phase  lasts 
for  T±  seconds,  where  is  the  firing  time  of  transition  t^.  At  the 
enl  of  this  time  duration,  the  firing  of  transition  t^  terminates,  and 
a  token  is  placed  on  each  output  place  p^  £  t^*.  This  completes  the 
firing  of  transition  t^. 

The  three  phases  of  a  transition  firing  can  be  visualized  by  imagining 
every  transition  as  consisting  of  two  transitions  and  an  intermediate 
place  as  shown  in  Figures  4.1.1.  The  firing  time  of  the  transition  t^ 
can  now  be  associated  with  the  place  in  the  following  fashion: 

When  transition  t^  initiates,  t^1  fires  instantaneously,  a  token 
Is  removed  from  each  input  place  of  t^’,  and  a  token  is  deposited  on 
place  n^.  This  token  is  held  by  place  for  a  duration  equal  to  T ^ , 
the  firing  time  of  t^.  At  the  end  of  this  interval,  transition  t^” 
fires,  corresponding  to  the  termination  of  t^. 

The  initiations  and  terminations  of  transitions  in  a  Petri  net  must 
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satisfy  the  following  balance  equation: 


Notation 


Let  p  be  a  place  in  a  Petri  net  'P  .  Let  be  the  inPut 


transitions  and  t  , ,...t.  the  output  transitions  of  p. 
jl  jm 


Let  I (t , t )  denote  the  number  of  initiations  o*  transition  t  up  to 


and  including  time  T, 


and  T(T,t)  denote  the  number  of  terminations  of  transition  t  up  to 


and  including  time  r. 


Then,  if  M(0,p)  is  the  number  of  tokens  on  place  p  at  time  zero,  and 


M(t ,p)  that  at  time  T,  the  following  must  be  satisfied: 


M(0,p)  +  T(T  ,t  .  )+. . .+  T(T,t  )  =  W(T,p)  +  I(T,t  .)  +...+  I  (T » t ,  ) 


We  will  refer  to  this  as  the 


mation.  and  will  make  use 


of  it  in  later  sections.  A  word  now  about  the  choice  of  a  f  ring  time 


I 


for  a  transition  in  a  Petri  net. 
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We  have  defined  T ^  to  be  a  non-negative  real  number,  thereby  assu¬ 
ming  that  the  duration  of  each  activity  is  fixed.  This  may  not  be  a  very 
accurate  picture  of  real-world  systems,  for  ,  in  practice,  the  execution 
time  of  an  operator  depends  upon  the  data  it  is  called  upon  to  handle. 

In  a  floating  -point  adder,  for  example,  the  add  time  will  depend  upon 
the  arguments  and  their  exponents.  Thus,  it  may  be  more  reasonable  to 
assume  that  a  transition  firing  time  is  a  random  variable  vAiose  distri¬ 
bution  can  be  represented  by  a  rectangular  distribution  of  the  form 
shown  in  Figure  4.1.2. 

Probability 


Figure  4.1.2 


We  will  return  to  a  consideration  of  statistical  firing  times  in  Section 


4.5. 


-111- 


4.2  Dynamic  Behavior  a f  Timed  Petri  Nets 

In  Chapters  2  and  3,  we  introduced  the  ideas  of  firing  sequence  and 
behavior  graph  to  characterize  the  action  of  Petri  nets.  For  timed  Petri 
nets,  we  have  postulated  the  existence  of  a  real-time  axis  against  which 
the  firing  of  transitions  can  be  noted.  At  time  zero,  the  net  has  a 
marking  M(0) .  Transitions  are  allowed  to  fire,  and  the  time  at  which 
the  firing  of  a  transition  takes  place  13  recorded  in  a  table  known  as 
a  firing  schedule. 

A  firing  schedule  for  a  timed  Petri  net  is  a  set  of  sequences  of  ini¬ 
tiation  and  termination  times  for  the  transitions  of  a  net.  The  firing 
of  a  transition  is  feasible  if  the  transition  was  enabled  at  the  ir>stant 
the  firing  was  initiated.  If  every  firing  in  a  firing  schedule  is  fea¬ 
sible,  the  firing  schedule  is  feasible.  A  firing  schedule  is  infeasible 
(or  not  feasible)  if  it  calls  for  the  initiation  or  termination  of  an 
activity  earlier  than  allowed  by  the  termination  of  other  activities. 
Figure  4.2.1  is  an  example  of  a  timed  Petri  net  and  Figure  4.2.2  gives 
a  feasible  firing  schedule  for  it. 


t 


2 


3 


Figure  4.2.1 


A  Timed  Petri  net 
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.  Number  of 
^Str  ansi  t  ion 

J Liring! 

Trans- 
itions  n. 

1 

2 

3 

4 

5 

fcl 

(0,  5) 

(12,  17) 

(24,  29) 

f'2 

(7,  10) 

(IS  22) 

(31,  34) 

t3 

(5,  7) 

(10,  12) 

(22,  24) 

(29,  31) 

Figure  4.2.2  Feasible  Firing  Schedule  for  Timed  Petri  Net  in  Figure  4.2.1. 


.Number  of 
^yfirings 

Transit. 

tions 

1 

2 

3 

4 

5 

fcl 

(0,  5) 

(12,  17) 

(24,  29) 

fc2 

(7,  10) 

(19,  22) 

« 

(30,  33) 

fc3 

(5,  7) 

(10,  12) 

(17,  19) 

(22,  24) 

(29,  31! 

Figure  4.2.3  Infeasible  Firing  Schedule  for  Timed  Petri  Net  in  Figure  4.2.1. 
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Figure  4.2.3  gives  an  infeasible  firing  schedule  for  the  same  net.  The 
starred  entry  implies  that  the  third  initiation  of  transition  t ^  takes 

i 

place  at  T  =  30;  this  is  impossible,  because  the  third  initiation  of 
t  cannot  take  place  before  the  termination  of  the  fifth  faring  of  t^, 
which  happens  at  T  =  31  units. 

Readers  will  note  that  in  the  feasible  firing  schedule  of  Figure 
4.2.2,  tne  initiations  of  transitions  t^,  t 2  and  t3  take  place  at  regu¬ 
lar  intervals.  For  example,  consecutive  initiations  of  transitions  t^ 
and  t£  occur  at  intervals  of  12  time  units.  Trans  tion  t^  behaves  in 
a  slightly  different  fashion.  We  notice  that  the  first,  third , fifth, .. . 
transition,  initiations  occur  every  12  time  units,  i.e.,  alternate 
initiations  occur  every  12  time  units. 

A  firing  schedule  with  this  property  is  termed  a  periodic  firing 
schedule.  If  all  transitions  in  the  net  can  have  consecutive  initia¬ 
tions  at  regular  intervals,  we  would  term  this  a  strongly-per iodic 
firing  schedule.  The  computat ion  rate  of  a  transition  is  the  average 
number  of  firings  of  that  transition  in  unit  time.  We  can  see  that 
transitions  t  and  1 2  have  a  computation  rate  of  once  every  12  time 
units ,  or  1/12.  Transition  t3  has  a  computation  rate  of  twice  every 
12  time  units,  or  1/6.  Hiese  computation  rates  are  the  maximum  rates 
possible  for  the  transitions. 

The  Petri  net  in  Figure  4.2.1  is  seen  to  be  an  LSP  net,  so  that  it 
represents  a  deterministic  system,  and  its  steady-state  equivalent  net 
is  the  multiply-labelled  event-graph  shown  in  Figure  4.2.4.  The 
multiplicity  of  transition  t3  is  2,  while  that  of  both  t^  and  t^  is  1. 
A  consistent  current  assignment  for  the  net  cp3  =  cp2  =  1,  cp3  = 


2. 
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Figure  4.2.4  Steady-state  equivalent  net  for  the  Petri 
net  in  Figure  4.2.1 


We  note  that  the  computation  rates  of  transitions  t^,  t ^  and  t^  are  in 
the  ratio  of  their  currents  in  a  consistent  current  assignment.  Before 
we  can  justify  this,  we  look  informally  at  the  computation  rate  of 
transitions  in  an  event  graph;  successive  generalizations  of  the  ideas 
presented  below  will  lead  us  to  all  results  of  interest  in  this  thesis, 
including  the  computation  rate  of  transitions  in  timed  SMD  Petri  nets. 

Consider  the  simple  circuit  shown  in  Figure  4.2.5,  in  which  each 
transition  t^  has  an  associated  firing  time  .  We  will  begin  by  assu¬ 

ming  that  the  circuit  has  one  token  on  it,  and  will  then  extend  the  ana¬ 
lysis  to  n  tokens.  With  one  token  on  the  circuit,  each  transition  can  fire 
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in  turn  over  and  over  again.  Suppose  that  the  circuit  is  denoted  by 
p^t^p^t^. . -Pjt  ,  and  suppose  (without  loss  of  generality)  that  initially 
the  token  is  on  place  p^.  Let  n  =  +. . .+  t  .  Then,  the  token 

fires  every  transition  in  the  circuit  in  f-  n  and  reappears  on  p^  every 
it  seconds,  assuming  that  no  time  is  allowed  to  elapse  between  a  tran¬ 
sition  being  enabled  and  being  initiated.  Under  this  assumption,  every 
transition  initiates  at  intervals  of  n  seconds,  and  «  is  the  period  of 
the  firing  schedule  for  the  circuit  that  realizes  the  maximum  computation 
rate.  The  computation  rate  is  easily  seen  to  be  p  =  1/rt.  Now  suppose  that 
the  circuit  has  n  tokens  on  it  instead  of  1.  For  the  combined  action  of 
the  n  tokens,  the  firing  rate  becomes  n/n,  or  n  .  Every  transition 


? 


T 


i 


in  the  circuit  has  a  maximum  computation  rate  given  by  this  expression. 

Let  us  now  consider  tiansitions  in  a  timed  event  graph.  We  know 
from  Theorem  3.2.1  that  in  an  event  graph  with  a  live  marking,  the  token 
content  of  every  simple  circuit  is  non-zero.  Thus,  let  the  token  content 
of  a  circuit  be  n^,  where  n^  1  0.  Also,  let  denote  the  sum  of  the 
firing  times  of  transitions  in  C^.  If  every  circuit  were  by  itself,  the 
transitions  in  circuit  would  have  a  computation  rate  of  n^  / 

However,  in  a  strongly-connected  event  graph,  the  circuits  are  inter¬ 
connected  and  intuitively  it  is  clear  that  they  will  affect  each  others 
natural  computation  rate  (i.e.,  the  computation  rate  the  transitions  in 
a  circuit  would  have  if  the  circuit  were  isolated  from  the  other  cir¬ 
cuits)  . 

Now,  without  loss  of  generality,  let  be  the  circuit  with  tb  '  sma¬ 

llest  natural  computation  rate  n  /«, .  Clearly,  all  transitions  t,,,...t,. 

n  I  11  ll 
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€  have  a  computation  rate  that  cannot  exceed  n^/rt^  (see  Figure 
4.2.6). 


Figure  4.2.6 

Extending  this  argument,  we  sea  that  if  there  exists  a  directed  path 
between  any  two  transitions  in  the  event  graph,  they  must  have  the  same 
computation  rate.  Since  every  transition  is  on  some  directed  path  from 
transitions  t^,  t^>  .  ..t^,  we  conclude  that  all  transitions  in  -  e 
event  graph  must  have  the  same  computation  rate,  which  cannot  exceed 
n^/n^.  What  this  means  is  that  in  any  timed  event  graph  (g,Q),  all 
transitions  have  the  same  computation  rate,  which  cannot  exceed 
min  n^/n^] 

where  C^,...C^  are  the  simple  circuits  of  the  graph, 
n^  is  the  token  content  of  circuit  C^,  and 

is  the  sum  of  the  firing  times  of  transitions  in  circuit  C^. 
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We  have  been  able  to  obtain, inf ormally,  a  bound  on  the  computation  rate 
of  transitions  in  a  timed  event  graph.  In  the  next  Section,  we  will 
show  that  this  bound  can  actually  be  realized. 
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4.3  Timed  Event  Graphs 


In  the  previous  section,  we  saw  informally  how  we  could  obtain  a 
bound  on  the  steady  state  computation  rate  of  a  timed  event  graph. 

Here,  we  will  show  that  the  bound  we  have  obtained  is  actually 
realizable;  furthermore,  we  will  show  that  there  exists  a  strongly 
periodic  firing  schedule  for  which  the  transitions  have  a  computation 
rate  equal  to  the  bound. 

We  will  use  the  notation  for  event  graphs  given  in  Definition  3.3.2. 

To  recapitulate,  let  ^be  a  timed  event  graph  <G,o)  where 

G  is  a  strongly-connected  event  graph  <V,A). 

Recall  that  V  =  {v,  ,v_,...v  1  is  the  set  of  transitions. 

L  i  m 

A  =  =  V  x  V  is  the  set  of  arcs, 

a^j  is  an  arc  that  connects  transition  v^  to  transition  v  ^ . 
n  is  a  function  that  assigns  to  each  transition  v^  a  firing  time  t^, 
i.e,,  Q:  V  — *  (where  H  is  the  set  of  non-negative  real  numbers). 

We  write  for  Q(v^). 

Let  M(0)  be  the  initial  marking  of  the  net.  Recall  that  M  is  used 
interchangeably  as  a  vector  and  a  function.  For  event  graphs, 

M(r,aij)  will  denote  the  number  of  tokens  on  arc  a^  at  time  r* 

Arcs  in  event  graphs  will  be  treated  in  the  same  way  as  places  in  un¬ 
restricted  Petri  nets. 

We  now  proceed  to  formalize  the  definition  of  a  feasible  firing 


schedule  for  timed  event  graphs. 
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Def Inition :  A  firing  schedule  S  for  a  timed  event  graph  ^  is  a  pair 
<Sj,  ST)  such  that 

Sx  :  T|  x  V  - ►  m 

ST  :  Tj  x  V  — ►  1R  ,  T|  -  (1,2...) 

and  ST(n,vi)  =  S^n.v^  +  T ^ 


Si(n,vi)  is  the  nth  initiation  time  of  transition  v^ 

ST(n,Vi)  is  the  nth  termination  time  of  transition 

A  firing  schedule  for  a  timed  Petri  net  may  not  be  realizable  because 
some  transitions  are  specified  to  fire  when  they  are  not  enabled.  A 
feasible  firing  schedule  is  one  that  does  not  violate  this  requirement. 
We  now  show  how  to  determine  the  marking  M(t)  at  any  instant  of  time  t 
for  a  timed  event  graph  Sj  which  has  a  feasible  firing  schedule  S. 

Notation:  At  any  instant  of  time  T  in  a  firing  schedule  S  for  timed 


event 


graph 


I  (T,v  )  denotes  the  number  of  initiations  of  transition  v^  up  to  and 
s  i 


including  time  r « 


T  (r,v  )  denotes  the  number  of  terminations  of  transition  v^  up  to  and 

S  X 


including  time  t • 


Let  a^  =  (v^v^)  be  an  arc  in  tbe  event  8raPb  G* 
By  token  balance 

H(r,a  )  =  M(0,a  )  +  Tg(r,vi)  -  IgOr.Vj). 


Now  consider  the  enabled  transition  v^  shown  in  Figure  A. 3.1.  If  an 
initiation  of  v^  takes  place  at  the  instant  t,  then  iirmediately  prior 


to  instant  T,  there  must  have  been  at  least  one  token  on  every  input 
arc  of  Vj .  We  will  use  the  notation  r"  to  denote  an  instant  (r~6)  where 
£  *"  0*  T  will,  for  all  practical  purposes,  denote  the  instant  t 

itself. 

Definition:  A  firing  schedule  S  for  a  timed  event  graph  j is  feasible 
iff  for  any  transition  v^  and  for  k  =  1,2,3 ... ,  we  have 
V  aij  €  ‘Vj,  MCSjlk.Vj]",  a  )  2  1 


Input  arcs  of 
transition  j 


Figure  4.3.1  An  instant  when  transition  v^  can  initiate. 

Suppose  we  now  consider  a  run  of  the  event  graph  J^.  Consider  an 
arc  a^j  that  initially  had  M(0,a^)  tokens  on  it.  Suppose  transition 
has  fired  n  times  up  to  and  including  an  instant  of  time  r.  It  is 
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obvious  that  in  the  same  interval  of  time,  transition  could  have 
fired  at  most  11(0,8^)  +  n  -  1  times  without  driving  the  token  content 
of  arc  a^  negative.  The  (M(0,a^)  -4-  n)th  initiation  of  transition  v^ 
must  await  the  nth  termination  of  transition  v^.  We  will  show  in  the 
next  Theorem  that  this  is  a  necessary  and  sufficient  condition  for  a 
firing  schedule  to  be  feasible. 

Theorem  4.3.1:  A  firing  schedule  S  for  a  timed  event  graph  ^  =  (G,Q) 
is  feasible  iff  for  each  arc  a^  in  the  graph  G,  and  for  n  =  1,2,3... 
ST(n,vi)  £  Sx(n  +  M(0,a^),  v  ^  ) . 

This  can  be  stated  equivalently  as 

S][(n,vi)  +  rt  *  Sj  (n  +  M(0,alj),  v  ) . 

Proof : 

Necessity,:  We  prove  this  by  contradiction.  Suppose  there  exists 
a  feasible  firing  schedule  such  that  for  some  n  and  some  a^ 

ST(n,vi)  >  Sj  (n+M(0,alj),  v^),  i.e., 

SI(n,vi)  +  T±  >  SjXn  +  MCO.a^),  v^). 

Let  t  =  Sj(n  +  MCOja^j),  v^)  and  therefore 
Is(T,Vj)  =  n  +  M(0,aij). 

Since  r  <  ST(n,vi>,  we  have  Tg(T,v.)  <  n. 


Using  token  balance, 
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M(r,aij)  =  M(0,ai j)  +  TgCr.v^  -  Is<t,  Vj) 

=  M(0,aij)  +  Tg(T,vi)  -  n  -  M(0,aij) 

=  Ts(r,vt)  -  n 
i.e.,  M(r,aij)  <  0 

This  is  the  desired  contradiction  of  feasibility  of  S. 

Sufficiency :  Suppose  S  is  a  feasible  schedule,  and  consider  any  instant 
of  time  t  >  0  such  that 

Sj.Cn  +  MCO.a^),  v^)  =  t 

We  wish  to  show  that  for  the  instant  r  just  prior  to  an  initiation  of  v 

M(T  .a^)  :>  1. 

Now,  at  time  t~ , 

MOr^.a^)  =  MCO.ajj)  +  -  (n  +  MCO.a^)  -  1) 

=Ts(t’,v1)  -  n  +  1 

but 

Ts(T"»vi)  *  n. 

Hence, 

MOf-.a^)  ^  1 

■ 

We  have  introduced  the  idea  of  a  strongly  periodic  firing  schedule  in 


Section  4.2.  We  now  give  a  formal  definition  of  a  strongly  periodic 
firing  schedule  for  a  timed  event  graph. 
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Def inition :  A  feasible  firing  schedule  S  for  a  timed  event  graph  y 
is  strongly  periodic  with  period  >  0  iff  there  exist  real  numbers 
^  0  such  that 

S].(n,vi)  «  x.  +  (n-l)Jt  n  =  1,2,... 

i  1 ,  £ , . . «  m . 

The  real  numbers  Xp...xm  are  the  displacements  of  the  firing  times  of 
transitions  from  the  instants  0,  «  f  2it,... 


Corollary  4.3. 1.1:  Let  a^  be  an  arc  in  a  timed  event  graph  Then 
has  a  strongly  periodic  firing  schedule  with  period  it  >  0  and  with 
displacements  Xp . .  .xm  iff  Va„  €  G,  we  have 
*i  +  Ti  s  +  M(0,aij). 


Proof : 


S 


I 


Theorem  4.3.1  states  that  V  €  G  and  for  n  = 
(n,vi)  +  T±  ^  S^n  +  M(0,aij),  v  ). . .  [4.1] . 


1,2,. 


By  the  definition  of  a  strongly  periodic  firing  schedule  we  have 
SI(n,vi)  *  x  +  (n  -  l)n 

and  SI(n  +  M(0,aij),  Vj)  =  xj  +  (n  +  M(0,a1j)  -  1). 

Substituting  into  inequality  4.1,  we  have 

xi  +  (n  -  1)  +  s  Xj  +  (n  +  MCO.a^)  -  1) 


or 


+  T,  s  x, 


M(0,aij) 


The  inequalities  of  Coronary  4.3.1. 1  can  be  rewritten 
Xj  -  ^  Ti  -  M(0,aij)  V  a^  e  G. 


The  quantity 
Let  us  write 

T±  ~  TTH(0,a 


-  nM(0,a1j 

ij*  =  Cij* 


) 


is  a  constant  for  each  arc  a 


ij' 


Corollary  4.3. 1.1  can  be  rewritten  as  follows: 

A  timed  event  graph  ^  has  a  strongly  periodic  firing  schedule  for 

displacements  x.,...x  iff 
1  m 

V  €  G,  we  have  Xj  -  xt  ^  c^.,.14.2] 

where  c  =  rt  “  TTH(0,aiJ). 


We  would  now  like  to  investigate  under  what  conditions  the  set  of 
inequalities  [4.2.j  ia  true.  In  order  to  do  this,  wl  make  a  brief 
excursion  into  the  theory  of  potentials  for  directed  graphs.  The 
development  given  here  follows  very  closely  the  material  in  Berge  and 
Ghouila-Houri,  pages  144-145  and  pages  155-157  [B3] . 


4.3.2  Existence  of  Potential  Differences  for  event  graphs 


Suppose  with  each  arc  a^  of  an  event  graph  G  we  associate  a  real 
number  0^  which  satisfies  the  condition  that  for  every  cycle  §  €  G, 


Ieu 

*u  €  ?* 


I6 


ij 

•ij  « 


where  denote  the  arcs  of  the  cycle  oriented  in  a  given  sense,  and  §" 
the  arcs  oriented  in  the  opposite  sen^e  (see  Figure  4.3.2).  Then,  we 


will  say  each  0^  represents  a  potential  difference  across  its  corresponding 
arc  Readers  familiar  with  circuit  theory  will  notice  the  similarity 


i 


I 

f 


I 
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a12’  a  23’  a34  6  ? 


a14  €  5 


Figure  4.3.2 


of  this  notion  of  potential  difference  to  that  used  for  electrical 


networks.  In  networks  we  can  choose  any  arbitrary  node  to  be  a 


reference  node  and  assign  it  a  potential  of  zero.  The  potential  of  any 


node  in  the  network  is  then  equal  to  the  potential  difference  between 


it  and  the  reference  node.  We  will  now  show  that  if  the  arcs  of  an 


event  graph  can  be  given  a  potential  difference  assignment,  then  it  is 


equivalent  to  saying  that  each  transition  v^  can  be  assigned  a  potential 


x,  such  that  the  potential  difference  0. ,  =  x  -  x, .  We  have  used  x. 
i  r  ij  j  i  i 


to  denote  the  potential  associated  with  transition  v^,  and  have  used  x^ 


earlier  to  denote  the  displacement  associated  with  v, .  This  has  been 


done  intentionally, 


Theorem  4.3.2:  A  function  A  — which  assigns  a  real  number  0. 


to  each  arc  a^  is  a  potential  difference  assignment  iff  there  exists 


•H  which  associates  with  each  vertex  v^  in  the  event 


I 

I 


a  function  X:  V 
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graph  a  real  number  x^  such  that  for  every  arc  a^  we  have 

etj  -  *j  •  v 

Sufficiency :  If  0^  is  defined  as  given, then  consider  a  cycle 
?  ■  (vj»  V2»  •  •  *vlc,vl^  * 

Define  5  =  +1  if  a,,  is  directed  in  one  sense 

aij 

§  -  -1  if  a.,  is  directed  in  the  opposite  sense. 

aij  lj 

^*en»  ^ai2  ®12  =  x2  ”  xi 

5  “  x, 

a23  .  23  :  3  :  2 

?M  '  Xl  '  ^ 

Sunning,  we  get 

l  hi  -  1  eu  -  °- 

*1J  €  6  ?' 

Thus,  the  assignment  0  is  a  potential  difference  assignment. 


Necessity ;  If  0  is  a  potential  difference  assignment,  let  us  define 
the  potentials  x^  step  by  step. 

Take  an  arbitrary  vertex  v^  and  assign  the  coefficient  x^  =  0  to  it. 
If  v^  has  been  labelled,  and  v^  has  not  yet  been  labelled,  and  if  a^ 
is  an  arc  in  A,  then  we  write 

*)  ~Ki +  V 

Similarly,  if  a^  is  an  arc  in  A,  then  we  write 
*i  -  *j  +  eu 
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or  Xj  = 


Since  the  event  graphs  we  are  interested  in  are  strongly  connected, 
the  potential  of  every  vertex  can  be  assigned  this  way. 

The  potential  assigned  to  a  vertex  v^  is  uniquely  defined;  for 
otherwise  there  would  exist  two  chains  C-^  and  C2  going  from  v^  to 
V£  such  that  they  form  a  cycle  around  which  the  potential  differences 
do  not  sum  to  zero  (see  Figure  4.3.3).  This  would  violate  the 
definition  of  potential  difference. 


We  now  examine  the  conditions  under  which  a  potential  difference 
assignment  exists  for  directed  graphs.  The  following  Theorem  holds 
for  all  connected  graphs  (i.e.,  not  just  strongly  connected  graphs). 


Theorem  4.3.3  :  Let  G  =  (V,A)  be  a  connected  graph,  and  let  us  associate 
with  each  arc  a^  f  A  an  interval  [k^,  l^jj,  where  s  1^.  A 
necessary  and  sufficient  condition  for  the  existence  of  a  potential 
difference  assignment 

•  =  ^  A) 

such  that 


ku  5  eij £  hj 
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1s  that  for  every  simple  cycle 


l}t  i 

Zhij 

aij  € 

*u 6 

2:  Yk. 

1}  ij 

L  ij 

aij  € 

aij  €  ? 

Proof:  (Suggested  by  Jack  Dennis;  we  disagree  with  the  proof  in  B3). 

Necessity:  The  condition  is  necessary;  for,  suppose  such  a  potential 
difference  assignment  ®  exists.  Then,  we  have,  for  a  cycle 

Su  ■  ■ 0 

*ij  «  *11  6  r 

If  we  use  the  lower  bound  of  k^  for  each  arc  in  the  first  term  and  the 
upper  bound  1  for  each  arc  in  the  second,  we  have 

S„  -  I1.!  •* 

•u  <  ’ll  *  r 

^  £kij 
aij  e  aij  G 

By  doing  the  converse,  we  get  the  second  inequality. 

Sufficiency :  Here,  we  shall  use  the  concept  that  each  interval  represents 
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a  set  of  points  on  the  real  line.  We  will  write 
,  1 i j 1  -  (x|x  ^  ^ij*  x  ^  ^ij  ) • 

We  will  use  the  concept  of  the  interval  sum  of  two  intervals  A  and  B, 
defined  as 

A  +  B  =  {a  +  b |  a  £  A  and  b  £  B}. 

Also,  the  interval  difference  of  two  intervals  A  and  B  is  defined  as 
A  -  B  =  (a  -  b |  a  €  A  and  b  £  B}. 

The  conditions  of  the  Theorem  can  be  written  as 


l1 11 

2,kij 

*  0  ...  [4.3] 

aij  6  S' 

*ij  6  5* 

and 

Zku 

-  Ilu 

s  0  ...  [4.4] 

*1J  e 

“ij  6 

Now  consider 

the  interval 

f  T* 

Vi 

yk  1 

J  -  1  ^  ij 

£  ij 

’  £  ij 

E ii  I 

aij  6  5’ 

•u  6 

aij  6  ?" 

aij  € 

From  [4.3]  and  [4.4], 

0  6  J  . 

..  [4.5]. 

But  J  can  be 

rewritten  as 

r 

Vi  ] 

-  f  ft 

Yi 

*r-J 

J 

ii 

’  Is  ij  I 

L  L  i*  ♦ 

*  L  ij 

aij  e  5* 

au  6  ? 

au  6  ? 
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-  XYu  '  £yij  [4-6! 

sij  €  r  *tJ  €  S* 

From  [4.5]  and  [4.6], 


0  € 


IY 


*ij  * 


ij 


€  t 


[A.  7] 


Thus,  the  conditions  of  the  Theorem  can  be  written  as  in  [4.7],  We  shall 

show  by  induction  that  [4.7]  implies  the  existence  of  a  potential  difference 

assignment  0  such  that  k.  £  £  1  .  This  is  true  for  a  graph  with  one 

arc;  we  shall  suppose  that  it  is  true  for  every  graph  with  (m  -  1) 

arcs  and  show  that  it  holds  for  a  graph  G  with  m  arcs  (labelled  l,...m) 

with  the  intervals  Y.,...Y  subject  to  condition  [4.7].  Let  G  be  a 

graph  with  (m  -  1)  arcs  that  is  obtained  by  deleting  arc  h  =  (v^,Vj) 

from  G,  and  let  ®  be  the  potential  difference  assignment  for  G.  Let 

6,  be  the  potential  difference  across  the  vertices  v,,v  in  the  graph  G 
h  1  J 

under  the  potential  difference  assignment  9  .  If  6^  €  Y^,  then  by  adding 
arc  h  to  G  and  assigning  it  a  potential  difference  9^  =  6^,  we  have 
constructed  for  the  graph  G  a  potential  difference  assignment 

0  =  6h)- 


If  S,  does  not  lie  within  the  limits 
h 

possibilities : 

Either  (a)  6^  <  k^ 

or  (b)  6h  >  lh« 


then  there  are  two 


We  will  assume  without  loss  of  generality  that  for  all  simple  cycles 


Figure  4.3.4. 


?  €  G  which  contain  the  arc  h,  h  €  ?  .  In  G  these  cycles  correspond  to 
simple  paths  from  to  v^.  Thus,  for  each  simple  path  S  from  to 
have 


0  € 


U  pq 

a  €  r 

pq 

a  h 

pq 


Yv 

L*  pq  + 


pq 

a  e 

pq  b 


or 


6he 


-I' 


pq 


+  n 


& 


a  €  §’ 

pq 

a  t  h 

pq 


pq 

a  6  5* 

pq  * 


We  will  refer  to  the  set  of  simple  paths  from  v.  to  v.  as  the  hammock  H. 

l  j  - 


^  =  { s | s  is  a  simple  path  from  to  v^}. 
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Case  (a)  6^  <  k^.  We  will  show  that  it  must  be  possible  to  reassign 
potential  differences  to  arcs  in  the  hammock  *•  so  that  6h  ^  1^.  We  do 
this  by  finding  a  cut  set  Cn  of  the  hammock  # (see  Figure  4.3.5). 


Figure  4.3.5.  Haranock  ft 


With  each  arc  m  =  a_  in  the  cut  set  C_,  we  associate  a  slack  9  defined  bv 

PH  n  -  m  J 

~  if  m  P 
m  m  m  ^ 

S=  0-  1  if  m  €  §“• 
m  m  m  3 

Let  =  min  ] 

l,...r  €  Cn 

We  increase  the  potential  difference  of  each  arc  m  €  by  dn  and 
decrease  the  potential  difference  0  of  each  arc  m  f  E~  by  3n 

Thus  6h  has  been  increased  by  dn.  If  6h  +  9°  >  kh,  we  are  done. 
Otherwise,  repeat  the  above  procedure  for  some  other  cut  set,  and  so  on. 
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If  no  more  cut  sets  are  found  for  which  this  potential  difference  reas¬ 
signment  can  be  done,  then  there  must  be  some  path  s  in  which  the 
following  is  true; 

'  km  T  "  « 

6m  -  l.  T  "  €  ?'• 

For  as  long  as  there  is  no  such  path,  there  must  exist  a  cut  set  whose 
arcs  do  not  have  potential  differences  equal  to  either  limit. 

Now  consider  the  cycle  consisting  of  the  path  s  and  arc  k. 

.  Yi  -  Jk  <  k 

0^  “  L-j  in  La  m  h 

m  €  §*  m  €  5“ 

m  k 

Recalling  that  k  was  defined  to  be  in  we  see  that 

Jl  -  Jk  <  ° 

Lt  m  L.  m 

m  €  5*  m  €  5“ 

which  contradicts  the  first  condition  of  the  Theorem,  namely  that 

Zhj  *  Ikij 

aij  €  ?  aij  €  ^ 

Similarly,  we  show  that  for  Case  (b),  i.e.,  Sh  >  lh,  it  must  be  possible 
to  reassign  the  potentials  in  the  hammock  H-  so  that  6,  <.  1,  .  If  not, 
we  can  show  by  a  method  similar  to  that  for  case  (a)  that  there  must 
exist  a  path  for  which 

Ihj  <  Ikij 

*ij  6  *ij  6  S' 
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which  contradicts  the  second  condition  of  the  Theorem. 
This  proves  the  Theorem. 


Corollary  4.3.3. 1 

A  potential  difference  assignment  &  such  that  0^  ^  (for  all 
a  6  G)  exists  if  and  only  if,  for  every  circuit  §, 

hi 

e? 


Proof:  Set  1^  «  +  ®  for  every  a^.  There  are  two  types  of  cycles: 

(a)  circuits,  i.e.,  cycles  in  which  all  arcs  are  oriented  in  the  same 
direction. 

(b)  cycles  which  are  not  circuits. 

For  every  circuit  we  see  that 


■u  € 

For  cycles  which  are  not  circuits,  we  see  that 


which  is  vacuously  true  if  all  k 


ij 


's  are  finite. 


We  are  now  in  a  position  to  show  that  the  bound  we  obtained  on  the 
computation  rate  of  transitions  in  a  timed  event  graph  is  actually 
attainable.  This  forms  the  subject  of  our  next  theorem. 
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Theorem  4.3.4:  Let  j  -  (G,n)  be  a  timed  event  graph.  J  has  a  strongly 
periodic  firing  schedule  with  period  n  >  0  iff 

K  c 

tt  ^  max  vi 


V  k 


*u  e  ck 

where  . .  .C^)  is  the  set  of  all  circuits  in  G. 


Proof:  prom  Corollary  4.3. 1.1  we  have  the  result  that  a  timed  event 
graph  has  a  strongly  periodic  firing  schedule  with  displacements 
Xl*...Xm  an<*  Peri°d  17  iff 


V  3ij  ^  G*  we  have  Xj  ~  xi  21  cij 


where  c^  =  T ±  -  TrM(0,a  ). 


[4.2] 


ij 

'.'low  with  each  arc  a  we  associate  0  =  x  -  x  . 

J  ij  j  i 

By  Theorem  4.3.2,  a  function  that  assigns  such  a  0  to  each  arc  a 

J  ij 

must  be  a  potential  difference  assignment.  The  set  of  inequalities  [4.2] 
now  becomes 

«  XT  —  M 

[4.8] 

By  Corollary  4.3.3. 1,  this  system  of  inequalities  has  a  solution  iff, 
for  every  circuit  §  €  G, 


9ij  *  cij  V  aij  €  G 


£ 


ij 

aij  €? 


<:  0 


[4.9] 


[4.9]  now  becomes 
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V  ?  €  G,  Z(fi  "  n  M<°»aij))  £  0 

aij  *  F> 

or  h  ‘  wIM<0»a1j)  50  V?€g, 

vi  6  §  aij  6  ? 

or  TT  2:  Zi 

Vi  €  £ _  V  §  6  G 

frMij) 

aij  €? 

This  is  true  iff 

[4.10] 

alJ  «  ? 


Comnents  on  Theorem  4.3.4;  We  note  that  there  exists  a  strongly  periodic 
firing  schedule  with  period 
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The  computation  rate  p  of  transitions  in  G  is  given  by  p 


Hence, 

P  *  O' 


§ 


min 
€  G 


3 


1 

TT 


[4.11] 


It  is  clear  from  the  inequality  [4.  i]  that  there  exists  a  strongly 
periodic  firing  schedule  which  realizes  a  computation  rate 
P  =P' 

Thus,  the  timed  event  graph  can  be  allowed  to  "run"  at  any  "speed"  less 
than  or  equal  to  p\  In  Section  4.2,  we  argued  that  the  computation 
rate  of  transitions  in  a  timed  event  graph  could  not  exceed  that  of 
transitions  in  the  circuit  with  the  minimum  ratio  of 


) 


!ull 


h 


vi 6  5 


but  our  arguments  were  non-rigorous.  In  this  section  we  have  sub¬ 
stantiated  our  claim  in  a  rigorous  fashion.  The  circuit  with  the 
minimum  ratio  of  token  content  to  sum  of  transition  firing  times  is 
termed  the  critical  circuit,  and  as  we  have  seen,  this  is  the  one 
which  determines  the  maximum  computation  rate  of  all  transitions  in 


the  net. 
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Sectlon  4.4  Timed  LSP  Nets 

We  now  proceed  to  apply  the  result  obtained  in  the  previous  Section 
to  determining  the  maximum  computation  rate  of  transitions  in  a  timed 
LSP  Petri  net.  Our  starting  point  will  be  to  decide  if  a  given  marked 
Petri  net  is  indeed  LSP.  This  can  be  done  by  applying  the  results  deve¬ 
loped  in  Section  2.3. 

Consider  a  timed  LSP  Petri  net  £  =  The  timed  Petri  net 

in  Figure  4.2.1  is  an  LSP  net,  and  we  saw  that  we  were  able  to  constr¬ 
uct  a  feasible  periodic  schedule  for  it.  We  now  show  that  every  LSP 
Petri  net  has  a  feasible  periodic  firing  schedule,  and  we  show  how  to 
find  the  period  and  the  cofiputation  rate  of  every  transition  in  the 
net . 

Recall  from  Section  3.4  that  the  steady  state  behavior  of  an  LSP 
net  *P  can  be  represented  by  the  steady-state  equivalent  net  J  . 
Furthermore,  recall  that  J  has  the  structure  of  a  multiply-labelled 
event  graph.  The  only  difference  between  event-graphs  and  multiply- 
labelled  event-graphs  is  that  that  the  latter  may  have  multiple  ins¬ 
tances  of  certain  transitions.  We  will  begin  by  assuming  that  all 
transitions  in  a  multiply-la Jelled  event  graph  have  distinct  labels, 
and  the  steady-state  equivalent  net  i  can  be  regarded  as  an  event  graph. 
Each  transition  is  assigned  the  firing  time  of  the  transition  it  corres¬ 
ponds  to  before  the  relabelling  was  dont.  The  computation  rate  of  trans¬ 
it!  Dns  in  the  resulting  event  graph  can  be  found  by  applying  the 
result  of  [4.11]. 

We  showed  in  Chapter  3  that  the  steady  state  equivalent  net  of  a 
consistent  LSP  Fe  .ri  net  must  have  a  number  of  occurrences  of  each 


transition  equal  to  its  current  in  a  consistent  current  assignment.  We 
now  show  that  the  computation  rate  of  transitions  in  a  timed  LSP  Petri 
net  must  be  proportional  to  their  currents  in  a  consistent  current 
assignment.  To  show  this,  let  us  consider  the  steady-state  equivalent 
net  J  .  We  have  seen  that  we  can  find  the  computation  rate  of  tran¬ 
sitions  in  the  relabelled  net  by  treating  it  as  an  event  graph.  We 
will  call  this  the  fundamental  computation  rate  of  the  LSP  net,  and  will 
denote  it  by  p.  If  the  multiplicity  of  any  transition  v^  is  p(v^),  it 
means  that  in  unit  time,  transition  v^  in  the  timed  LSP  net  £  fires 
p(v^)  X  p  times,  because  each  firing  of  an  instance  of  transition  of 
transition  v^  in  J  is  also  a  firing  of  transition  v^  in  ^ .  Hence, 
we  can  find  the  computation  rate  of  any  transition  v^  by  sipply  multi¬ 
plying  p  by  the  multiplicity  of  that  transition  in  J. 

Summarizing,  the  following  steps  are  are  involved  in  finding  the 
computation  rate  of  transitions  in  a  timed  LSP  Petri  net  jC  : 

Step  1  Find  the  steady  state  equivalent  net  A  of  the  underlying  LSP 
Petri  net  9  . 

Step  2  Treating  &  as  an  event  graph,  find  its  maximum  computation 

rate  p  by  applying  [4.11].  This  gives  the  maximum  fundamental 
computation  rate  of  the  timed  LSP  Petri  net  X  . 

Step  3  If  p^  is  the  maximum  computation  rate  of  transition  v^  in  £  , 
and  p(v^)  is  its  multiplicity  in  i  ,  then  p^  =  p  X  ^(v^). 

We  illustrate  this  method  by  an  example. 

Example  Consider  the  timed  LSP  Petri  net  of  Figure  4.2.1  (reproduced 
in  Figure  4.4.1).  In  order  to  make  our  example  as  general  as  possible, 


-142- 


the  net  has  a  marking  M  that  is  not  safe,  but  which  is  live,  bounded 
and  persistent.  In  Figure  4.4.2,  we  show  the  steady  state  equivalent 
J  of  the  net  ?  with  marking  M.  J  can  be  decomposed  into  four  circui¬ 
ts  : 


C1!  Vi'AVzVA 

C2  *  p3tlp2t2p3 
C3*  pltlp2t2p4t3pl 
C4=  p3tlp4t3plt2p3 

The  fundamental  computation  rate  p  is  seen  to  be 
p  =  min[  2/12,  1/8,  2/10,  1/10] 
or  p  =  1/10. 

Now,  M-Ctj)  =  1 
P(t2)  =  1 

P(C3>  '  2 

Thus  the  computation  rates  of  transitions  tj,^  and  t3  are  as  follows: 
P1  =  1/10 
P2  =  1/10 
p3  =  2/10. 


A  note  on  the  Structure  of  LSP  Petri  nets 

Although  it  is  true  that  the  LSP  Petri  nets  that  arise  in  the  course 
of  modelling  practical  systems  will  all  be  SMD  there  do  exist  Petri  nets 
which  are  LSP  but  not  SMD.  The  net  in  Figure  4.4.3  is  an  LSP  Petri  net, 
but  it  is  not  SMD  as  the  reader  can  easily  verify.  The  method  we  have 
given  for  finding  the  computation  rate  for  timed  LSP  Petri  nets  holds 
for  all  LBP  Petri  nets  which  have  a  live,  safe  marking. 


-144- 


4.5  Timed  Petri  nets  with  random  firing  times 

The  assumption  of  deterministic  firing  times  for  Petri  nets  has  given 
us  a  great  deal  of  insight  into  the  action  of  practical  asynchronous  sys¬ 
tems.  We  have  been  able  to  find  the  computation  rate  of  transitions  in 
timed  event  graphs  and  LSP  Petri  nets,  and  in  each  case  we  have  been  able 
to  identify  system  components  that  limit  the  fundamental  computation  rate 
of  the  net  (i.e.,  the  critical  circuit).  The  critical  circuit  represents 
a  bottleneck  in  the  system  being  modelled,  ami  readers  familiar  with 
PERT  networks  will  see  the  correspondence  between  critical  circuits  in 
timed  event  graphs  and  critical  paths  in  PERT  networks. 

The  assumption  of  deterministic  firing  times  may  not  be  sufficiently 
accurate  for  practical  systems.  In  practice,  an  action  in  an  asynchronous 
system  like  an  addition  will  be  a  random  variable  ,  and  in  a  large  number 
of  cases  this  can  be  approximated  by  either  a  rectangular  or  a  Gaussian 
distribution  (see  Figures  4.5.1  and  4.5.2  respectively).  A  Gaussian 
distribution  has  the  problem  that  it  has  a  "tail"  which  extends  for  nega¬ 
tive  values  of  time.  Since  we  cannot  visualize  an  action  taking  less  than 
zero  time  to  occur,  we  would  prefer  to  use  the  rectangular  distribution 
shown  in  Figure  4.5.1. 

It  is  possible  to  find  out  the  mean  computation  rate  of  a  timed  event 
graph  when  all  firing  times  are  given  by  a  distribution  like  the  rectan¬ 
gular  distribution.  However,  the  process  is  very  tedious,  and  we  will 
work  with  the  means  of  each  of  the  firing  times.  By  doing  this,  we  get 
a  timed  event  graph  whose  computation  rate  can  be  found  by  applying  Equa¬ 
tion  [4.11].  It  can  be  shown  by  using  the  results  of  Clark[C3]  that  the 
computation  rate  so  obtained  is  an  upper  bound  on  the  true  computation 
rate  of  the  timed  event  graph  whose  firing  times  are  random  variables. 
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The  deviation  of  the  true  mean  computation  rate  from  this  bound  depends 
upon  on  the  standard  deviations  of  the  random  variables,  but  there  does 
not  appear  to  be  any  simple  expression  that  relates  these  quantities. 

We  now  turn  to  Chapter  5,  where  we  find  a  bound  on  the  computation 
rate  of  timed  SMD  Petri  nets. 


CHAPTER  5 


COMPUTATION  RATE  OF  ASYNCHRONOUS  SYSTEMS  WITH  DECISIONS 

5.1  Timed  SMD  Petri  Nets 

In  Chapter  3,  we  Illustrated  the  power  of  SMD  Petri  nets  in  model¬ 
ling  asynchronous  systems  with  decisions.  We  also  established  two 
important  results  about  SMD  Petri  nets: 

(a)  It  is  decidable  if  any  given  marking  for  an  SMD  Petri  net  is 
live . 

(b)  For  SMA  Petri  nets,  any  marking  which  puts  at  least  one  token 
on  every  state  machine  is  live. 

Thus,  using  the  results  presented  in  Chapter  3,  we  can  design  asyn¬ 
chronous  processing  systems  for  possible  implementation  by  the  tech¬ 
niques  developed  by  Dennis  and  Patil  [Dl,  D2]  .  We  now  show  how  we  can 
estimate  the  computation  rate  of  this  more  general  class  of  systems. 

Recall  that  in  Chapter  4  we  developed  a  technique  for  finding  the 
maximum  computation  rate  of  a  large  class  of  deterministic  systems. 

In  this  chapter,  we  look  at  non-determinist ic  systems,  and  see  if  we  can 
obtain  a  bound  on  their  computation  rate.  We  do  this  by  finding  a 
bound  on  the  computation  rate  of  transitions  in  a  timed  SMD  Petri  net. 

We  have  defined  timed  Petri  nets  in  Section  4.1  (see  definition  4.1.1). 
Thus,  a  timed  SMD  Petri  net  is  an  SMD  net  in  which  every  transition  has 
a  fixed,  non-zero  firing  time.  For  details  of  the  firing  mechanism 
and  notation  for  the  instantaneous  marking  of  a  timed  SMD  net,  the 
reader  is  urged  to  re-read  Section  4.1. 


-146- 


-148- 


Let  us  now  consider  a  timed  SMD  Petri  net  and  examine  its  be¬ 
havior.  We  do  this  by  constructing  a  firing  schedule  for  it,  and 
interpreting  the  notion  of  computation  rate  of  a  transition  in  con¬ 
nection  with  the  firing  schedule. 

As  a  simple  example  to  motivate  this  chapter,  consider  the  SMD 
Petri  net  model  of  an  assembly  line  with  decisions  of  the  type  dis¬ 
cussed  in  Section  1.2.  Figure  5.1.1  is  such  a  Petri  net  model.  Let 
transitions  a,  b  and  c  represent  assembly  operations,  and  let  p  be  a 
place  which  represents  a  bay  at  which  a  decision  is  made  about  which 
engine  should  be  attached  to  a  chassis.  The  engines  are  available  in 
the  input  queues  connected  to  transitions  b  and  c,  and  completely  as¬ 
sembled  autos  are  output  into  the  queues  marked  "output  queues."  Each 
of  the  assembly  operations  represented  by  the  transitions  a,  b  and  c 
have  associated  time  durations  T^,  T^,  and  T ^  respectively.  A  partially 
assembled  auto  that  appears  at  place  p  can  be  routed  one  of  two  ways, 
by  firing  either  transition  d  or  transition  e.  We  said  in  Section  1.2 
that  this  routing  can  be  done  in  several  ways,  and  the  exact  mechanism 
of  making  this  routing  decision  does  not  co..’ern  us  here.  We  have 
pointed  out  some  of  the  ways  in  which  this  decision  can  be  made  for  a 
practical  assembly  line.  From  a  performance  standpoint,  we  must  know 
the  relative  numbers  of  each  type  of  automobile  which  are  produced  by 
the  assembly  line.  Let  us  see  why.  Suppose  a  large  proportion  of 
all  cars  produced  by  the  assembly  line  are  200  HP  cars,  and  suppose 
that  the  final  assembly  process  for  attaching  a  200  HP  engine  to  a 
chassis  is  extremely  slow  (i.e.,  transition  f  has  a  long  firing  Lime). 
This  assembly  s’- age  will  not  be  able  to  handle  the  load  imposed  by  it, 


and  will  slow  down  the  operation  of  the  whole  line  by  choking  off  the 
absorption  of  subassemblies  from  the  input  queues.  It  is  intuitively 
obvious,  therefore,  that  this  assembly  line  can  be  balanced  by  matching 
the  speed  of  operation  of  an  assembly  station  to  its  expected  frequency 
of  use.  It  can  also  be  seen  that  there  is  a  tradeoff  between  the  num¬ 
ber  of  physical  processing  units  at  an  assembly  station  and  the  speed 
of  each  processing  unit  -  a  number  of  slow  processing  units  can  be 
used  instead  of  a  few  fast  ones,  and  vice  versa.  Our  question  in 
connection  with  this  assembly  line  is  -  how  do  we  estimate  the  rate  at 
which  it  produces  assembled  autos? 

Let  us  examine  our  model  carefully  and  see  what  information  is  needed 
to  specify  the  system  in  a  way  which  is  complete  for  our  purpose.  We 
have  shown  input  queues  connected  to  some  of  the  transitions  in  the  net 
in  Figure  5.1.1.  We  pointed  out  in  Section  1.2  that  we  will  assume 
that  there  is  always  at  least  one  item  in  each  input  queue,  so  that  it 
is  not  necessary  to  indicate  the  queues,  but  we  have  shown  the  queues 
for  completeness.  Let  us  now  see  how  to  incorporate  the  relative  fre¬ 
quency  of  use  of  the  two  alternative  assembly  stations  into  out  model. 

The  action  of  the  Petri  net  in  Figure  5.1.1  can  be  represented  by 
means  of  an  occurrence  graph  (see  Section  3.5.3).  In  the  occurrence 
graph  shown  in  Figure  5.1.2,  the  probability  or  the  relative  frequency 
of  occurrence  of  each  of  the  outcomes  of  the  decision  made  at  place  p4 
will  be  reflected  in  the  number  of  occurrences  of  the  corresponding 
transition  in  a  long  frustrum  of  the  occurrence  graph.  For  example,  if 
the  probabilities  of  occurrence  of  transitions  d  and  e  are  ^  and 
respectively,  then,  in  a  long  frustrum  of  the  occurrence  graph  there 
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will  on  an  average,  be  twice  as  many  occurrences  of  transition  e  as  of 
transition  d.  This  means  that  in  a  consistent  current  assignment  for 
the  Petri  net,  the  current  assigned  to  transition  e  will  be  twice  that 
assigned  to  transition  d.  Our  problem  is  to  find  the  maximum  computation 
rates  of  transitions  tg  and  tg  in  the  net.  In  general,  we  would  like 
to  find  the  maximum  computation  rate  of  any  transition  in  a  timed 
SMD  net,  for  which  a  consistent  current  assignment  is  given.  We  wish 
to  find  a  computation  rate  assignment  such  that  the  computation  rate 
of  each  transition  is  proportional  to  its  current  in  the  given  con¬ 
sistent  current  assignment. 

The  problem  can  be  tackled  by  simulation.  In  such  a  method  the 
computation  rate  of  any  transition  can  be  found  by  letting  the  net  "run" 
for  a  long  time,  and  then  dividing  the  number  of  times  that  transition 
has  fired  by  the  total  amount  of  time  that  has  elapsed  since  time  zero. 
The  method  is  extremely  time  consuming  and  we  would  like  an  analytical 
technique  which  enables  us  to  find  a  good  bound  on  the  computation  rate 
of  transitions  in  the  timed  net. 

Consider  a  timed  SMD  net  X  =  (P,  ft) ,  having  a  minimal  integer  con¬ 
sistent  current  assignment  $.  We  know  from  Section  3.5  that  several 
c-equivalent  nets  are,  '.n  general,  possible  for  X  with  consistent  current 
assignment  $.  In  Figure  5.1.3(a)  we  reproduce  the  net  shown  in  Figure 
3.5.4.  For  the  given  minimal  consistent  current  assignment,  several 
c-equivalent  nets  are  possible,  and  we  give  two  of  them  (see  Figures 
5.1.3(b)  and  5.1.3(c).  Each  timed  c-equivalent  net  defines  a  periodic 
schedule.  We  compute  their  fundamental  computation  rates,  and  get 
the  following: 


Figure  5.1.3  fb) 


Applying  the  results  of  Section  4.4, 

Fundamental  Computation  Rate  p'  =  min 

Figure  5.1.3  (c) 

Fundamental  Computation  Rate  p'  =  min  j~-|yr  yy  yy  =  24  * 

Thus,  the  maximum  Fundamental  Computation  Rates  of  the  two  c-equivalent 
nets  are  different.  In  order  to  find  a  bound  on  the  maximum  fundamental 
computation  rate  of  the  timed  net  X,  we  must  find  the  timed  c-equivalent 
net  which  has  the  largest  computation  rate. 

The  maximum  fundamental  computation  rate  of  a  c-equivalent  net  for 
the  timed  net  X  represents  the  maximum  fundamental  computation  rate  of 
transitions  in  the  net  X  for  the  behavior  specified  by  that  c-equivalent 
net.  This  leads  to  the  following  definition: 

Definition  5.1.1  The  maximum  fundamental  computation  rate  of  transitions 
in  a  timed  SMD  net  X  =  (^,0/  for  a  minimal  integer  consistent  current 
assignment  $  is  given  by  the  fundamental  computation  rate  of  the  c- 
equivalent  net  which  has  the  largest  fundamental  computation  rate. 


Definition  5.1.2  The  maximum  computation  rate  of  a  transition  t^  be¬ 
longing  to  a  timed  SMD  Petri  net  X  =  with  a  minimal  integer 

consistent  current  assignment  I  is  given  by 
Pt  =  tPi  x  p 

vhere  qy  is  the  current  assigned  to  t^  by  $and  p  is  the  maximum  funda- 
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mental  computation  rate  of  x- 

We  see  that  to  obtain  an  exact  value  of  the  maximum  fundamental  compu¬ 
tation  rate  of  X»  we  must  find  all  the  c-equivalent  nets  for  X  and 
find  the  one  with  the  largest  fundamental  computation  rate.  This  it 
a  very  tedious  process,  and  we  would  like  to  find  a  simple  method 
which  gives  a  bound  on  the  fundamental  maximum  computation  rate  of  X. 
Consider  the  timed  SMD  Petri  net  X  =  (^,n)  in  which  the  underlying 

Petri  net  *P  has  a  live  marking  M.  Let  S.  ,  S„,  .  .  .S  be  the  state 

i  £.  m 

machine  components  of  P.  Let  the  transitions  of  any  state  machine 
component  Sj^  be  t^,  t^,.  •  their  firing  times  be  T^,  T  .»  •  •  »Tik 

and  their  currents  be  cp^,  cPi2>***cPlk  respectively.  Also,  let 
Pil>  Pi2'***pik  ke  the  P*aces  belonging  to  the  state  machine  S^.  Let  n^ 
denote  the  number  of  tokens  on  state  machine  S^. 

Consider  the  occurrence  graph  associated  with  one  of  the  tokens  on 
the  state  machine  (see  Figure  5.1.4). 

Since  there  are  n^  tokens  on  the  state  machine  S^,  the  maximum  natural 
computation  rate  of  transition  t^  is  thus  or  The 

quantity  Y^  is  termed  the  maximum  natural  fundamental  computation  rate 
of  the  state  machine,  and  to  obtain  the  maximum  natural  commutation 
rate  of  transition  k^,  we  multiply  Y  by  cp^.  The  term  maximum 
natural  computation  rate  of  a  transition  t^  refers  to  the  computation 
rate  it  would  have  if  the  state  machine  were  isolated  from  the  other 
components  of  the  SMD  Petri  net  (P .  However,  as  we  saw  in  the  case  of 
timed  event  graphs,  the  components  of  a  timed  net  affect  each  other's 
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fundamental  computation  rate.  We  now  show  that  if  two  state  machines 
S.  and  S.  have  some  transition  t  in  common,  then  the  fundamental 
computation  rate  of  both  state  machines  is  min  fY^Yj].  Let  cp  be 
the  current  associated  with  transition  t  in  a  consistent  current 
assignment.  Then,  it  is  clear  that  the  computation  rate  of  t  cannot 
exceed  cp  X  min  [Y.,  YJ.  For  suppose  without  loss  of  generality  that 
Yi<  V  Then’  the  computation  rate  of  t  cannot  be  greater  than  cp  x  Y .  . 
If  it  could  be  greater  than  cp  x  Y^  it  would  violate  the  definition  of 
the  fundamental  computation  rate  of  Thus,  the  computation  rate  of  t 

cannot  exceed  cp  x  min  [Y.^l.  AlSOf  by  consistency,  the  computation 
rate  of  any  transition  tfl  €  S£  must  equal 

^il  X  cp  X  min  [Y  Y  ]  =  cp  X  min  fY  ,Y  ] . 
cp  J  11  i  j 

Similarly,  the  computation  rate  of  any  transition  t.  in  S,  equals 

Jm  j 

X  cp  x  min  [Yi,Yj]  =  cpjm  X  min  fYi>Yj]. 


We  conclude  that  if  two  state  machine  components  and  in  a  timed 
SMD  Petri  net  have  a  transition  iu  common,  each  state  machine  acquires 


a  fundamental  computation  rate  =  min  fY^Y^]. 


Let  p  min  fY x  ,Y2> .  ,  .YJ  .  Without  loss  of  generality,  let  p  =  Y 
We  now  construct  sets  ^....^  of  state  machines  as  follows: 


If  is  a  set  of  state  machines,  then  define  \  to  be  the  set 

P  ^  pi*  p2’’**^pr^  whcrG  ^p2,'*’^pr  are  state  machines  which 

have  at  least  one  transition  in  common  with  the  state  machines  in  \  . 

p 

Also,  -  [Sj,  s2»-'*smK  I*e*»  all  the  state  machine  components  of  P. 
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Construct  \  „  from  \  until  \  =  \  .  Then  \ 

P+1  P  P+1  p  p 

the  state  machines  because  P is  strongly-connected. 


must  include  all 


The  reader  can  now  see  that  if  the  maximum  fundamental  computation 

rate  of  =  Y^,  then  the  maximum  fundamental  computation  rate  of  all 

state  machines  in  is  also  Y^,  and  so  on  until  we  have  exhausted  all 

state  machines  in  the  net  P.  Thuo,  all  state  machines  in  x  must  have  a 

maximum  fundamental  computation  rate  p  =  Y,  =  min  [Y,  ,Y  , . .  .Y  1  where 

1  12m 

^2’“*^m  are  t*le  state  “achine  components  of  P  . 

The  maximum  computation  rate  of  any  transition  t^  in  the  net  ip  thus 
^  X  p. 

We  have  thus  established  the  following  result: 

In  a  timed  SMD  Petri  net  \  =  (*P  ,fa)  ,  the  maximum  computation  rate  of 
any  transition  t^  is  given  by 

pi  *  ^  x  p' 

where  p'  =  min  [Y. ,Y,,...Y  ] 
i  l.  m 

where  ^^>^2,***^m  are  t*le  max*mum  fundamental  computation  rates  of 
Sr  S2»...Sm.  The  maximum  fundamental  computation  rate  Y^  of  state 
machine  is  given  by 


Z  "v  T»j 


j=l 

where  n.  =  number  of  tokens  on  state  machine  S,  .  t.  ,  ,t.  .  .t,  are 

K  k  kl  k2  kr 

the  transitions  of  state  machine  Sk>  are,  respectively,  the 

current  in  a  consistent  current  assignment  for  and  firing  time  of 
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transition  t^  €  S^. 

The  bound  we  have  obtained  here  is  based  on  the  assumption  that 
when  a  token  appears  at  a  place  with  conflict,  then  its  further 
routing  (i.e.,  which  output  transition  to  fire)  is  done  statistically, 
using  an  a-priori  probability  measure  proportional  to  the  currents 
associated  with  the  respective  transitions.  In  the  next  section, 
we  derive  this  bound  by  considering  the  c-equivalent  nets  for  X. 
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5.2  Rigorous  Derivation  of  Bound  on  Fundamental  Computation  Rate  for 
Timed  SMD  Petri  Nets 

In  the  previous  section,  we  saw  how  we  can  find  the  upper  bound  on 
the  maximum  fundamental  computation  rate  of  a  timed  SMD  Petri  net  x  with 
a  minimal  integer  consistent  current  assignment.  This  involved 
drawing  all  the  timed  c-equivalent  nets  for  X  and  finding  the  one  with 
the  largest  maximum  fundamental  computation  rate.  As  we  pointed  out, 
this  is  a  tedious  process.  The  following  Theorem  gives  an  upper  bound 
on  the  maximum  fundamental  computation-  rate  of  a  timed  SMD  Petri  net 
in  terms  of  its  structure,  marking,  minimal  integer  consistent 
current  assignment  and  firing  time  assignment. 


Theorem  5.2.1  In  a  timed  SMD  Petri  net  x  31  with  a  minimal  integer 

consistent  current  assignment  $,  the  maximum  fundamental  computation 

rate  is  given  by  p '  =  min  [Y  ,. .  .Y  ].  Y-...Y  are  the  fundamental 

x  m  i  m 

computation  rates  of  the  state  machine  components  of  x.  The  fundamental 
computation  rate  Y^  of  state  machine  S^  is  given  by 


where  n^  =  number  of  tokens  on  state  machine  S^. 
t^^,...t^r  are  the  transitions  of  state  machine  S^. 

^kj*  Tkj  are  t*le  current  and  firing  time  respectively  of  transition 
fckj  6  Sk* 
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Proof:  Consider  the  c-equivalent  net  €  of  the  SMD  net  P  for  the 
consistent  current  assignment  $.  We  showed  in  Section  3.5  that 
for  every  state  machine  component  SR  in  P,  there  exists  a  corresponding 
circuit  C^€  6  with  the  following  property: 

Every  transition  in  has  a  multiplicity  ji(t  )  equal  to  the 
current  ^  assigned  to  transition  t£  in  9  by  the  minimal  consistent 
current  assignment  i.  Now  let  C^,...^  be  the  simple  cir¬ 
cuits  in  6 ,  where  correspond  to  state  machines  of  (P  &nA 

^nri-l’"*^r  are  8^mP^e  circuits  in  €  that  do  not  correspond  to  state 
machines  of  Let  ¥j,...Ym,  Yr  he  their  respective  fundamental 

computation  rates.  For  any  circuit  C  €  {c  ,...C  ) 

i  1  *  m 


I  Vu 


Now,  the  fundamental 
p ’  =  min  [Y^ , . . . 
■  min 

or  p’s  min  [Y  , . . . 


computation  rate 


Y  ] 
r 


Y  , 
m’ 


m 


Y 

m+i 


P 


of  the  timed  net  y  is  given  by 


Note : 

There  may  exist  some  *  €  {Y  ..  ,...,Y  }  such  that 

T  s  mri  r 

*„  <  min  [Y.....Y  ]. 
a  1  m 

Thua,  while  p'  =  min  , . . . Ym ]  is  certainly  a  bound,  this  bound  may  not 

be  achievable.  The  computation  rate  p£  of  any  transition  ^  is  defined 
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to  be 


Pt  ■  P  *  v 

Once  again,  this  value  of  Is  a  bound,  but  this  bound  may  not  be  achie¬ 
vable.  Theorem  5.2.1  enables  us  to  find  a  bound  on  the  computation  rate 
of  any  transition  in  a  timed  3MD  Petri  net  by  finding  for  each  state 
machine  compon  nt  S^  the  corresponding  fundamental  computation  rate  i|r  . 
This  is  simpler  than  finding  the  maximum  fundamental  computation  rate  of 
all  the  c-equivalent  nets  for  the  SMD  Petri  net. 

Example:  Let  us  apply  this  result  to  the  timed  SMD  Petri  net  of  Figure 

5.1.3.  Theorem  5.2.1  gives  a  bound  on  the  maximum  fundamental  computation 
rate  equal  to 

min  [  1/15,  1/22]  «  1/22. 

By  drawing  all  the  c-equivalent  nets  for  the  SMD  net,  the  reader  can 
verify  that  the  net  of  Figure  5.1.3(c)  has  the  largest  maximum  fundamen¬ 
tal  computation  rate  of  1/24.  The  bound  of  Theorem  5.2.1  cinnot  actua¬ 
lly  be  realized  in  any  consistent  behavior  of  the  SMD  net,  but  we  see 
that  it  is  certainly  a  reasonable  bound.  The  reason  that  the  bound  is 
not  achievable  is  that  there  is  no  state  machine  component  in  that 
corresponds  to  the  multiply-labelled  circuit  P^p^p^p^. .  .t^. 

This  circuit  has  a  maximum  natural  fundamental  computation  rate  of  1/24. 
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5.3  Achlevabillty  of  Bound  on  Computation  Rate  for  timed  SMD  Petri  neLs 


The  bound  on  the  computation  rate  of  transitions  in  a  timed  SMD  Petri 
net  that  we  have  presented  in  the  previous  two  Sections  has  been  shown 
to  be  an  upper  bound  on  the  computation  rate  for  a  given  SMD  Petri  net 
with  a  minimal  integer  consistent  current  assignment.  We  have  also  shown 
that  this  bound  may  not  be  achievable,  and  in  this  Section  we  look  at  the 
issue  of  achievability. 

When  the  reader  examines  the  expression  for  the  bound,  he  will  notice 
that  it  is  always  achievable  for  event  graphs.  Since  the  multiplicity 
of  every  transition  in  an  event  graph  is  one,  this  means  that  there  exists 
a  minimal  integer  consistent  current  assignment  in  which  the  current  assi¬ 
gned  to  each  transition  is  unity.  Also,  since  a  simple  circuit  in  an 
event  graph  corresponds  to  a  state  machine  componaat  if  we  view  the  event 
graph  as  an  SMD  Petri  net,  we  can  rewrite  Theorem  5.2.1  for  event  graphs 
to  read  as  follows: 


In  a  timed  event  graph  (G,fi),  the  computation  rate  of  all  tran¬ 

sitions  in  the  net  is  the  same  and  is  given  by 

P  £  min  , . .  ,i|f  ]  , 
i  m 

where  C^,...Cm  are  all  simple  circuits  of  the  event  graph  and 

,  .  •£ 

1  Z 

T  *M  5  s 


This  is  the  same  bound  as  the  one  we  obtained  in  Expression  [4.11] 
in  Section  4.3.  We  also  showed  in  Section  4.3  that  this  bound  is  achie- 


-165- 


vable.  Thus,  the  result  in  Theorem  5.2.1  represents  an  achievable  bound 
if  the  SMD  Petri  nets  we  consider  are  restricted  to  being  event  graphs. 
Our  question  is  -  how  good  a  bound  is  it  for  more  general  SMD  Petri  nets? 
The  largest  class  of  SMD  Petri  nets  we  have  been  able  to  prove  achieve- 
bility  for  is  the  class  we  term  a-SMD  Petri  nets. 

Definition  5.3.1  An  cn-minimal  Integer  consistent  current  assignment  is 
one  in  which  all  transitions  with;  more  than  one  input  place  are  assigned 
unit  current. 

Definition  5.3.2  An  SMA  Petri  net  is  ct-SMA  iff  there  exists  an  ci-mini- 
mal  integer  consistent  current  assignment  for  it. 

Lemma  5.3.1  Let  g  be  the  c-equivalent  net  of  an  a-SMA  Petri  net  *P  for 
the  a-minimal  integer  consistent  current  assignment  $.  Then,  eveiy 
circuit  in  6  corresponds  to  a  state  machine  component  in  ^  and 
vice  versa. 

Proof 

Consider  a  state  machine  component  Sk  in  *P .  Let  7  be  a  cyclic  frus- 

trum  for  &  .  We  know  that  the  initial  slice  of  J-  must  contain  at  least 

one  place  from  every  state  machine  component  (  follows  from  Theorem 

3.5.1).  Choose  some  place  pkl  €  Sk  from  the  places  in  the  initial 

slice  of  ?  .  When  two  consecutive  instances  of  this  place  in  a  cyclic 

frustrum  for  the  given  current  assignment  are  considered,  every  state 

machine  containing  pk^  must  unfold  into  a  chtin  that  begins  and  ends  at 

Pki*  the  c-equivalent  net  constructed  from  the  cyclic  frustium,  each 
chain  corresponds  to  a  circuit. 
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Consider  the  cyclic  frustrum  for  the  given  c-equivalent  net. 

Since  9  is  SMA,  every  closed  simple  directed  path  is  an  SSM.  Also, 
since  every  transition  which  has  multiple  input  places  has  unit 
current,  there  is  exactly  one  instance  of  each  such  transition  in 
a  cyclic  frustrum.  Every  allocation  reduction  on  the  c-equivaient 

net  results  in  exactly  one  SSM  (a  multiply-labelled  circuit)  for  each 
SSM  in  9. 

Suppose  some  closed  directed  simple  path  does  not  correspond  to  any 
SSM.  Then  it  must  correspond  to  some  closed  structure  in  which  there 
exists  a  transition  t  with  multiple  input  places.  The  only  way  in 
which  such  a  closed  structure  could  map  into  a  circuit  is  for  there  to 
be  more  than  one  instance  of  it  in  a  cyclic  frustum,  which  is  the  de¬ 
sired  contradiction  (see  Figure  5.3.1). 


Multiple  Instances  of  t ^ 

Figure  5.3.1 


As  a  consequence  of  Lemma  5.3.1,  we  have  the  following  theorem: 
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Theorem  5.3.1:  The  bound  on  the  computation  rate  of  transitions 
attained  in  Theorem  5.2.1  holds  for  every  timed  a  -  SMD  Petri  net 

x  -  <^,n> 

Proof :  By  Lemma  5.3.1,  the  c-equivalent  net  tl  for  an  ^  -  SMD  Petri  net 
is  a  multiply  labelled  event  graph  in  which  every  simple  circuit 
corresponds  to  a  state  machine  component  of  'V  and  vice  versa.  By 
attaching  firing  times  to  each  transition  in  the  desired  result 
follows . 


Example :  Figure  5.3.2(a)  gives  an  example  of  a  timed  a  -  SMA  net, 
shown  with  a  consistent  current  assignment.  Figure  5.3.2(b)  gives 
a  c-equivalent  net  which  realizes  the  computation  rate  given  by 
the  expression  of  Theorem  5.2.1.  The  reader  will  note  that  each  state 
machine  in  the  a  -  SMD  Petri  net  of  Figure  5.3.2(a)  maps  into  a  multiply 
labelled  circuit  in  Figure  5.3.2(b)  and  vice  versa. 

In  Figure  5.3.3(a)  the  timed  SMD  Petri  net  of  Figure  5.3.2(a)  is  shown 
with  a  minimal  integer  consistent  current  assignment  in  which  transition 
t^  has  a  current  equal  to  2.  This  net  is  no  longer  a  ■  SMA,  and  the 
bound  of  Theorem  5.2.1  can  no  longer  be  achieved,  as  can  be  verified 
by  examining  its  c-equivalent  net  which  has  the  largest  maximum 
fundamental  computation  rate. 


Bound  from  Theorem  5.2.1 


Figure  5.3.2(a) 
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Bound  for  c-equlvalent  net  of  Figure  5.3.3(b) 
p'  -  min  [20*  25  '  26*  I?]  =  26' 

Thus,  the  bound  from  Theorem  5.2.1  is  not  achievable  by  any  c-equivalent 
net  for  the  given  timed  SMD  Petri  net. 
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Section  5.4  Review  of  Results  Obtaineo 

In  this  Section  we  will  put  our  work  in  perspective  and  will  point 
out  its  relationship  to  PERT  networks  [FI]  and  the  program  flowcharts 
of  Martin  and  Estrin  [Ml,  M2,  M3]. 

In  Chapter  2  we  have  argued  that  practical  systems  can  be  viewed 
as  an  ensemble  of  interacting  components  and  that  each  system  component 
can  be  viewed  as  a  state  machine.  Subsequently,  we  investigated  the 
modelling  of  such  systems  using  Petri  nets.  We  introduced  SMD  Petri 
nets  as  being  the  class  of  Petri  nets  which  can  model  practical  systems. 

The  type  of  systems  we  wish  to  model  do  not  have  any  redundant  functional 
operators,  and  are  free  of  deadlock.  This  translates  to  the  problem  of 
choosing  only  those  SMD  Petri  nets  which  have  a  live  marking.  We  have 
pointed  out  that  we  still  do  not  understand  how  the  state  machine 
components  of  an  SMD  Petri  net  should  be  interconnected  to  ensure  that 
the  net  has  a  live  marking.  Currently,  the  largest  subclass  of  SMD 
Petri  nets  we  know  of  with  this  property  is  the  class  of  SMA  Petri  nets. 

Since  any  live  marking  for  an  SMD  Petri  net ?  is  bounded,  a  consistent 
current  assignment  can  be  made  to  the  transitions  in  y.  By  multiplying  all 
currents  in  a  consistent  current  by  the  least  common  multiple  of  their 
denominators  and  dividing  them  by  the  greatest  common  divisor  of  their 
numerators,  we  get  the  minimal  integer  consistent  current  assignment, 
lhe  current  associated  with  a  transition  in  a  minimal  integer  consistent 
current  assignment  is  the  multiplicity  of  the  transition  in  a  c-equivalent 
net  of  the  SMD  Petri  net  for  the  given  current  assignment. 

We  have  also  looked  at  the  entire  class  of  LSP  Petri  Nets  (i.e., 

Petri  nets  which  have  a  Live,  Safe, Persistent  marking).  Even  though 
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Figure  5.4.2 


We  have  introduced  timed  Petri  nets  in  order  to  model  the  finite 
speed  of  operation  of  practical  systems  and  have  shown  how  to  find  the 
computation  rate  of  transitions  in  LB  SMD  Petri  nets  and  LSP  Petri  nets. 
For  both  kinds  of  timed  Petri  nets,  the  maximum  computation  rate  of  a 
transition  is  the  fundamental  computation  rate  of  the  timed  c-equivalent 
net  for  the  timed  Petri  net  multiplied  by  the  multiplicity  of  the 
transition  in  the  c-equivalent  net. 

In  order  to  find  the  maximum  fundamental  computation  rate  of  a  timed 
c-equivalent  net  (or  a  steady-state  equivalent  net  for  an  LSP  Petri  net) 
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we  assume  that  all  transitions  are  dlctlnctly  labelled,  and  we  find  the 
maximum  computation  rate  of  the  resulting  timed  event  graph.  The  maximum 
computation  rate  of  a  timed  event  graph  can  be  found  by  applying 
Equation  [4.11]. 

We  illustrate  the  various  terms  used  and  the  relationship  between 
the  computation  rates  of  timed  event  graphs,  multiply-labelled  event 
graphs,  LSP  Petri  nets  and  SMD  Petri  nets  by  means  of  Figure  5.4.3. 

The  diagram  is  self-explanatory  and  shows  that  the  bound  of  Equation 
[4.11]  forms  the  cornerstone  of  our  work,  the  fundamental  computation 
rate  of  timed  SMD  and  LSP  Petri  nets  being  obtained  by  finding  the  funda¬ 
mental  computation  rate  of  an  equivalent  multiply-labelled  event  graph, 

We  are  now  in  a  position  to  point  out  how  our  work  relates  to  the 
following  models  of  parallel  processing  and  parallel  computation: 

(a)  PERT  networks 

(b)  Martin  and  Estrin  Flowcharts. 

Let  us  begin  with  PERT  networks  [Fl] .  A  PERT  network  consists  of  an 

acyclic  directed  graph  with  an  input  vertex  and  an  output  vertex.  All 

arcs  in  the  network  lie  on  paths  from  the  input  vertex  to  the  output 

vertex.  Each  arc  denotes  an  activity  in  the  project  being  modelled 

by  the  network,  and  each  activity  takes  a  certain  amount  of  time  to 

occur.  We  can  model  a  PERT  network  as  a  timed  acyclic  event  graph  by 

replacing  each  arc  with  its  two  end  vertices  by  the  structure  shown  in 

Figure  5.4.4.  The  time  associated  with  the  activity  ab  is  now  associated 

with  the  timed  transition  t  ,  introduced  into  the  arc  ab.  Transitions  a 

ab 

and  b  are  assumed  to  have  zero  firing  time.  In  Figure  5.4.5  we  show 
this  transformation  carried  out  on  an  example  PERT  network. 
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Suppose  we  draw  an  arc  from  f  to  a  in  Figure  5.4.5.  The  resulting 
graph  is  a  strongly-connected  event  graph,  whose  critical  circuit 
corresponds  to  the  critical  path  of  the  PERT  network.  In  Chapter  6 
we  will  see  how  strongly-connected  timed  event  graphs  can  be  applied 
to  many  practical  situations  where  PERT  networks  are  inadequate. 

We  now  turn  to  the  program  flowcharts  of  Martin  and  Estrin  [Ml,  M2, 
M3].  These  are  directed  graphs  consisting  of  arcs  and  nodes.  Nodes 
represent  operations  in  a  computer  program  and  arcs  represent  data 
paths  between  them.  The  presence  of  a  data  value  on  a  data  path  is 
represented  by  a  marker.  Each  node  has  logical  conditions  (and, 
exclusive-or )  on  data  values  on  the  input  arcs  that  must  be  satisfied 
before  the  node  can  act.  The  action  of  a  node  consists  of  removing  a 
data  value  from  the  specified  logical  combination  of  input  arcs,  per¬ 
forming  a  computation, and  then  depositing  data  values  on  the  specified 
logical  combination  of  output  arcs.  The  action  of  a  node  is  assumed  to 
take  some  finite  amount  of  time.  In  Figure  5.4.6  we  show  how  to  model 
these  timed  nodes  by  means  timed  acyclic  Petri  net  structures.  A  Martin 
and  Estrin  flowchart  models  program  constructs  like  decisions  and 
iteration  together  with  aspects  of  parallel  programs,  like  the  fork  and 
join  operations  [D6],  A  test  is  performed  to  check  if  the  operation  of 
the  flowchart  can  proceed  in  a  deadlock- free  manner;  if  an  arc  is  drawn 
from  the  output  node  to  the  input  node,  this  test  turns  out  to  be  similar 
to  the  one  for  deciding  if  an  SMD  Petri  net  is  SMA.  Our  work  can  be 
viewed  as  modelling  cyclic  or  recurrently  acting  systems,  while  the 
work  of  Mtrtin  and  Estrin  is  concerned  with  finding  the  mean  execution 
time  of  parallel  programs.  The  two  pieces  of  work  taken  should  provide 
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a  good  bag  of  tools  for  the  analysis  and  design  of  asynchronous 


computer  systems. 
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CHAPTER  6 

APPLICATIONS  OF  TIMED  PETRI  NETS  TO  THE  MODELLING  OF  ASYNCHRONOUS 
CONCURRENT  SYSTEMS 


Deterministic  Systems 

w"  1,111  now  consider  applications  for  our  work  drawn  from  diverse  dis¬ 
ciplines  such  as  computer  systems  modelling  and  operations  research.  Through¬ 
out  this  discussion,  we  will  keep  In  mind  the  distinction  we  have  made  bet¬ 
ween  deterministic  and  non-determiniatic  systems.  We  begin  by  presenting 
several  types  of  deterministic  systems.  The  simplest  system  we  wish  to  con¬ 
sider  is  a  set  of  adder  units  whose  action  can  be  represented  by  the  timed 
event  graph  shown  in  Figure  6.1.1. 


Figure  6.1.1 


Each  token  on  the  circuit  represents  a  physical  hardware  adder  unit  which 
can  be  in  one  of  three  states  shown  (i.e.,  "ready  to  pick  up  operand  pair", 
"ready  to  perform  addition"  and  "ready  to  output  result").  We  will  make 
the  assumption  that  there  is  always  an  operand  pair  available  in  the 
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input  queues,  so  that  a  hardware  adder  unit  never  has  to  be  idle  for  want 
of  operands,  i.e.,  the  environment  does  not  introduce  delays  into  the 
operation  of  the  system.  Applying  Equation  [4.11],  the  maximum  rate  at  which 


results  appear  in  the  output  queue  is  given  by 


n  /  (  t1  +  t2  +  t  ) 


where  n  is  the  number  of  physical  hardware  adder  units, 


V  T2  and  T3 


are  the  time-:  required  to  perform  the  actions 


modelled  by  transitions  t^.t^  an<*  *3- 

Let  us  asssme  some  values  for  the  above  parameters. 


Let  n  =  4. 


T  *  500  nsec. 

T1 

rr  —  9  ii <  "» n 

T2 

T  =  500  nsec. 

3 

The  maximum  throughput  rate  of  the  adder  is  then  4/3  =?  1.33  million 

additions  per  second. 

A  more  interesting  example  is  the  timed  Petri  Net  model  of  a  three-stage 
pipelined  floating  point  adder  shown  in  Figure  6.1.2. 


Figure  6.1.2 
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This  pipelined  adder  has  only  one  hardware  unit  per  stage,  so  that  there  is 

overlapped  operation  of  the  stages  without  any  parallelism  in  each  stage. 

The  maximum  throughput  rate  of  the  system  is  seen  to  be 

min  f  1/t  ,  1/t  1/t  ]. 

a  b  c 

Let  us  examine  the  above  expression.  Suppose  1/Ta  is  the  smallest  of  the 
three  quantities  in  parenthesis.  Then  stage  A  has  the  slowest  hardware 
unit,  and  the  natural  computation  rate  of  this  stage  determines  the  through¬ 
put  rate  of  the  system.  This  means  that  having  hardware  units  in  stages  B 
and  C  that  are  faster  than  the  hardware  unit  of  stage  A  is  wasteful,  since 
their  added  speed  does  not  result  in  any  extra  system  throughput.  We  will 
say  that  a  deterministic  system  is  balanced  if  the  natural  computation  rate 
of  all  system  parts  is  equal.  In  the  context  of  the  pipeline  adder  in  Figure 

6.1.2.  this  means  that  T  =  T  =  T.. 

a  b  c 

A  more  complex  pipelined  adder  would  be  one  in  which  there  are  multi¬ 
ple  hardware  units  in  each  stage.  Such  a  pipelined  a^der  is  shown  in  Figure 

6.1.3. 


Stage  A  Stage  B  Stage  C 


Figure  6.1.3 
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Stages  A,B  and  C  have  n^  ,  and  n  hardware  functional  units,  respectively. 
In  this  case,  the  throughput  of  the  syltem  is  given  by 

min  f  n  /t  ,  n./r  ,  n  /t  ] . 
a  a  b  b  c  c 

For  the  system  to  be  balanced,  we  see  that  na^Ta  =  nb^Tb  =  nc^Tc' 

This  implies  the  following: 

If  a  stage  consists  of  slow  hardware  functional  units,  a  proportionately 
larger  number  of  units  should  be  present  in  the  stage  to  ensure  that  the 
system  is  balanced.  Absence  of  system  balance  implies  that  there  are  parts  of 
the  system  with  excess  capacity  that  cannot  be  used.  Our  results  can  thus 
be  used  to  test  for  system  balance  in  deterministic  systems,  and  they  serve 
to  formalize  the  intuitive  notions  of  balance  that  a  hardware  designer 
would,  no  doubt,  use  in  the  design  of  pipelined  systems. 

PERT  Networks  and  Project  Scheduling 

PERT  charts  are  used  in  Project  scheduling  to  determine  the  shortest  tir  e 
that  it  takes  for  an  ensemble  of  concurrent  activities  to  complete,  given 
the  precedence  relationships  between  the  activities  and  the  time  duration  of 
each  activity.  We  examined  the  relationship  between  PERT  charts  and  timed 
Event  Graphs  in  Section  5.6.  We  now  see  that  by  using  timed  event  graphs, 
we  can  model  aspects  of  project  schediling  and  assembly  lines  that  are  not 
within  the  power  of  PERT  charts.  The  two  main  advantages  of  timed  event 
graphs  over  PERT  charts  is  their  ability  to: 

(a)  model  systems  that  act  recurrently. 

(b)  model  physical  resource  units  explicitly. 

Suppose  we  consider  the  project  represented  by  the  PERT  chart  in  Figure 
6.1.4.  The  project  consists  of  activities  "a"  through  "g"  with  the  prece¬ 
dence  constraints  expressed  by  the  PERT  chart.  We  begin  by  drawing  the 
equivalent  acyclic  event  graph  for  this  PERT  chart  ,  using  t!  e  method  given 


In  Section  5.6.  This  acyclic  event  graph  is  shown  in  Figure  6.1.5. 


b  <Tb> 


Figure  6.1.4  A  PERT  Chart . 


Now  suppose  we  want  the  PERT  chart  in  Figure  6.1.4  to  represent  a  manufac¬ 
turing  process,  where  t^  represents  the  start  of  the  process,  and  t  the 
end.  We  assume  that  each  activity  in  the  system  requires  the  use  of  a 
unit  of  resource,  like  a  lathe,  a  milling  machine,  etc.  We  wish  to  express 
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the  fact  that  there  are  only  a  finite  number  of  units  of  each  resource  type. 
A1  so,  some  resource  units  may  be  very  expensive  and  may  have  to  be  shared 
among  several  activities  (e  g.  a  high  precision  lathe).  How  are  we  to  ex¬ 
press  such  system  constraints  using  a  PERT  chart?  The  answer  is  that  there 
is  no  way  of  doing  this  without  augmenting  the  structure  of  PERT  charts.  Let 
us  see  what  added  descriptive  power  can  be  had  by  using  timed  event  graphs. 

Around  each  of  the  transitions  t  through  t  ,  we  draw  a  loop.  Each  loop  is 

a  g  r  r 

marked  with  a  number  of  tokens  equal  to  the  number  of  physical  processing 
units  available  for  the  correspond!  ig  activity.  The  resulting  event  t raph 
is  shown  in  Figure  6.1.6. 


Figure  6.1.6  Event  graph  which  models  limited  resources  available 
for  each  activity  in  PERT  chart. 

In  addition  to  the  transition  representing  an  activity,  each  loop  consists 

of  a  second  transition,  which  we  will  interpret  as  the  act  of  allocating 

a  resource  unit  to  the  activity.  Since  we  wish  to  model  a  recurrently  acting 

production  facility,  we  complete  the  loop  between  t  and  t  and  add  a  large 

o  i 

number  of  tokens  to  place  p  as  shown  in  Figure  6.1.7.  One  of  the  self-loops 


Figure  6.1.7 


(i.e.,  loops  around  each  of  the  transitions  t  ...t  now  becomes  the  criti- 

a  g 

cal  circuit.  The  system  can  be  balanced  by  assuming  that  the  natu¬ 
ral  computation  rate  of  each  of  these  circuits  is  the  same,  i.e. 

\/(Tal+V  *  V'W  '  V<T81+  V 

The  n's  denote  the  number  of  tokens  on  the  corresponding  circuits.  If  all 
the  n's  are  infinite,  then  the  processing  capacity  of  the  production  facility 
becomes  infinite.  This  is  the  .situation  represented  by  Figure  6.1.5.  Since 
any  production  facility  in  real  life  has  only  finite  resources  available 
to  it,  these  resources  can  be  explicitly  represented  as  we  did  in  Figure 
6.1.6. 

/ 

A  word  now  about  adding  an  infinite  number  of  tokens  to  the  loop  formed 
by  joining  tQ  to  t^  through  place  p.  Each  token  represents  the  possibility 
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of  a  set  of  subassemblies  being  input  to  the  production  facility  for  process¬ 
ing.  Our  assumption  that  we  can  add  an  infinite  number  of  tokens  to  place  p 
is  equivalent  to  saying  that  there  is  an  unbounded  amount  of  buffer  capacity 
between  the  activities  t  ...t^.  In  practice  the  amount  of  buffer  space 
(in  the  form  of  storage  bays)  between  work  stations  in  a  production  process 
is  bounded,  and  this  would  place  a  limit  on  the  number  of  tokens  we  can  put 
on  place  p.  In  that  case,  we  would  have  to  consider  the  natural  computation 
rate  of  all  circuits  in  Figure  6.1.7  including  circuits  like 


t.t  tvt  t  t.  ,  t^t.t^t  t  t,  ,  etc. 
iabcoi  idfgoi 

Let  us  give  an  example  of  how  the  buffer  capacity  of  the  system  may  affect 
the  maximum  throughput  rate  of  the  system.  We  do  this  by  considering 

some  actual  values  for  system  parameters. 
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We  see  that  circuit  t^t.t^t  t  t. 

idfgoi 


limits  the  throughput  rate  of  the  system. 
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The  throughput  rate  of  the  system  becomes  min  f  2/3,  1/2,  2/4,  3/6,  2/5, 
3/8,  2/3,  100/9,  100/11,  100/17]  =  3/8. 

The  loop  around  transition  t  now  becomes  the  bottleueck  in  the  system. 

The  production  facility  we  have  considered  so  far  was  arranged  in  such  a 
way  that  each  activity  has  its  own  set  of  resource  units  available  to  it.  We 
now  consider  the  issue  of  resource  sharing. 


Consider  the  production  process  shown  ifc  Figure  6.1.8.  Once  again, 

na ’  * '  * ne  denote  the  number  of  tokens  on  the  loops  around  transi¬ 
tions  t^  ...  t  respectively,  we  assume  the  following  values  for  the  parame- 


ters  of 

the 

system: 

Ta2 

= 

Tb2  =  * 

•  • 

Te2 

Tal 

= 

6 

n 

a 

=  ,  1. 

Tbl 

= 

2 

% 

=  1. 

Tcl 

= 

5 

n 

c 

=  1. 

Tdl 

= 

10 

nd 

=  2. 

Tel 

= 

4.5 

n 

e 

=  1. 

0. 
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We  make  the  assumption  that  the  same  type  of  resource  is  used  by  both 

activities  t  and  t, .  In  Figure  6.1.8  ,  each  activity  has  its  own  resource 
a  b 

unit.  In  this  configuration,  the  system  throughput  is 

min  [  1/6,  1/2,  1/5,  2/10,  1/4.5  j  =  1/6. 

We  see  that  activity  t  is  the  bottleneck  in  the  system.  Since  activities 

t  and  t  use  the  same  type  of  resource,  it  is  possible  to  pool  their 
a  b 

resource  units  together  .  Now  whenever  resources  are  pooled  together,  some 
resource  allocation  strategy  must  be  adopted  to  ensure  fair  resource  allo¬ 
cation  to  the  contending  resource  users.  Since  event  graphs  can  model  only 
deterministic  systems,  we  will  use  a  simple  strategy  in  which  each  resour¬ 
ce  unit  is  allocated  alternately  to  the  two  activities.  The  resulting 
system  is  shown  in  Figure  6.1.9. 


Figure  6.1.9 

We  assume  that  all  parameters  of  the  system  are  the  same  as  before;  this 
time  the  throughput  of  the  system  is 

min  [  2/8  1/5,  2/10,  1/4.5  ]  =  1/5. 
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Thus ,  the  activity  t  is  no  longer  the  bottleneck  in  the  system.  It 

a 

should  be  noted  that  other  resource  allocation  strategies  are  possible, 
but  they  result  in  non-deterministic  nets. 

So  far,  the  system  models  we  have  considered  have  all  been  timed 
event  graphs.  We  now  give  an  example  of  a  system  which  is  modelled  by 
an  LSP  Petri  net. 

Figure  6.1.10  is  a  timed  LSP  Petri  net  model  of  the  assembly  line 
that  alternately  outputs  two  types  of  automobiles,  which  we  discussed 
in  Section  1.2. 

The  maximum  computation  rates  of  transitions  t  and  t  are  of  interest 

5  6 

because  they  tell  us  the  maximum  rate  at  which  automobiles  are  output 
by  this  assembly  line.  These  computation  rates  can  be  found  from  the 
timed  steady  state  equivalent  net  of  Figure  6.1.10(b). 

We  now  suppose  that  there  is  parallelism  within  some  of  the  assembly 
stages,  but  that  automobiles  are  still  manufactured  alternately.  This 
can  be  modelled  by  adding  tokens  to  places  in  the  net  of  Figure  6.1.10(a) 
other  than  Pg  and  p^,  and  we  get  an  LBP  Petri  net  of  the  type  shown  in 
Figure  6.1.10(c). 

The  reader  will  recall  from  Section  3.4  that  in  order  to  draw  the 
steady  state  equivalent  net  of  an  LBP  net  that  is  also  LSP,  tokens  are 
removed  until  a  life,  safe  marking  results  and  then  drawing  the  steady 
state  equivalent  net  for  it.  The  marking  is  constructed  by  the  technique 
given  in  Section  3.4.  We  get  the  steady  state  equivalent  net  of  Figure 
6.1.10(d).  Once  again,  the  maximum  computation  rate  of  transitions  k^  and 
kg  can  be  found. 

The  system  models  we  have  considered  so  far  have  all  been  determinis- 


-  - . . ^ 


tic  Petri  nets.  We  now  turn  our  attention  to  the  modelling  of  non- 


deterministic  systems. 

The  simplest  example  of  a  non-deterministic  system  that  we  wish  to 
model  is  the  pipelined  processor  with  decisions  of  the  type  discussed 
in  Section  1.2.  A  timed  SMD  Petri  net  model  for  this  is  given  in 
Figure  6.1.11. 


t 


=  T. 


T5  = 


0 


Figure  6.1.11 

This  net  has  five  state  machine  components,  as  the  reader  can  easily 
verify  (indicated  as  Stage  1,..., Stage  5).  Two  types  of  instructions 


can  be  processed,  Type  A  and  Type  B.  Let  us  suppose  that  the  relative 
frequency  of  these  two  instruction  types  is  in  the  ratio  of  2:3  (this 
can  be  found  by  statistical  analysis  of  program  traces).  This  leads  to 


the  following  minimal  integer  consistent  current  assignment: 


,1  ,  cpa  =  (P2  =  cpb  =  <P3  =  5 

V  =<P  =  <P  =  <P  =  2 

c  4  e  b 


cpd  =  cp5  =  cpf  =  cp7 


3. 


The  computation  rate  of  transition  t^  gives  the  maximum  rate  at 
which  instructions  are  absorbed  from  the  input  queue.  Similarly,  the 
computation  rates  and  p^  give  the  maximum  rates  at  which  results 
are  placed  in  the  output  queues.  To  calculate  these  quantities,  we 
first  find  p'.  From  Theorem  5.2.1, 


p'  =  min  [  <1^,  f2,  1^,  t[(4,  i|r5] 

where  each  of  the  terms  represents  the  fundamental  computation  rate  of 
the  corresponding  stage  in  the  system.  System  balance  requires  that 


J.et  T\  be  the  firing  time  of  transition  t^  where  i  =  a,...,f.  Then,  we 
have 

1 

p'  =  minf  -  ,  etc....] 


•  1  1  1  1 

min  5T  ’  5TU  ’  2T  +  3T/  2T 
a  b  c  d  a 

Suppose  we  choose  T  =  T  =  T  =  T  =  T  =  T  = 

a  b  c  d  e  f 

then  p'  =  1/5  and  the  system  is  not  balanced. 

A  balanced  and  more  economical  system  results  when 

T £  =  1.67  p  sec.,  the  other  parameters  being  the  same. 


1 


1  (isec. , 

T  =  2.5  u  sec. , 
e 

Under  this  new 


firing  time  assignment, 
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therefore, 


=  2  x  105 


instruct  ions /sec . 


~  5  X  p  =  106  instructions/sec. 

Pg  -  2  X  2  X  10  =  4  X  105  instructions /sec . 

p?  -  3  X  2  X  10  =  6  X  10  instructions/sec. 

The  reader  can  construct  further  examples  based  on  processors  in  which 
there  are  multiple  hardware  units  within  each  stage,  and  can  work  out 
a  method  for  an  optimal  design  based  on  certain  objective  and  cost 
functions.  We  shall  not  attempt  to  do  this  here. 

The  SMD  Petri  nets  we  have  exhibited  so  far  have  mainly  been  models 
of  pipelined  processors  or  assembly  processes.  Let  us  now  look  at 
models  for  interacting  cyclic  processes  in  computer  systems.  We  begin 
with  a  model  for  two  processes  that  interact  with  each  other  through 
mailboxes.  Figure  6.1.12  gives  a  schematic  or  a  flowchart  model  for 
such  a  pair  of  processes.  One  of  the  processes  is  deterministic  (or 
decisionless)  and  the  other  one  has  two  decisions  in  it.  The  processes 
communicate  by  passing  messages  to  each  other  through  mailboxes,  and 
the  reader  can  convince  himself  that  they  can  operate  concurrently 
without  deadlocking.  This  can  be  verified  formally  \<han  the  Petri  net 
model  for  this  system  shown  in  Figure  6.1.13  is  examined.  The  net  is 
seen  to  be  SMA,  and  in  Figure  6.1.14  we  indicate  its  state  machine  com¬ 
ponents.  Since  the  marking  shown  puts  at  least  one  (in  this  case 


exactly  one)  token  on  each  state  machine,  the  marking  is  clearly  live. 
This,  as  the  reader  will  recall  from  Chapters  2  and  3,  means  that  the 
system  of  processes  has  no  redundant  operators,  and  furthermore,  that 
the  processes  can  operate  without  being  deadlocked. 


Figure  6.1.12 


Figure  6.1.13 


mm ifeggM 


Figure  6.1.14(d) 


Having  thus  established  that  the  Petri  net  in  Figure  6.1.13  has  a 
live  marking,  we  can  now  suppose  that  the  transition  labelled  "op  19" 
represents  an  operation  which  outputs  a  result  every  time  the  transition 
fires.  Our  problem  now  is  to  find  out  the  maximum  rate  at  which  op  19 
outputs  results,  given  the  usual  parameters  like  the  firing  tires  of  the 
transitions  and  a  minimal  integer  consistent  current  assignment.  The 
places  marked  "op  8"  and  "op  16"  represent  decisions,  each  decision 
having  two  outcomes.  The  relative  frequency  of  the  outcomes  of  each 
decision  can  be  found  by  statistical  means.  Let  the  probabilities  of 
op  11  and  op  9  be  p^  and  p^  respectively,  where  p^  +  p^  =  1. 

Similarly,  let  p^g  and  p^  be  the  probabilities  of  op  18  and  op  17 
respectively,  such  that  p^  +  p^g  =  1.  We  can  now  obtain  a  minimal 


integer  consistent  current  assignment  for  the  net,  where  each  current  is 
denoted  as  usual  by  cp,  and 


!n 

= 

P11 

©- 

P9 

'Pis 

a: 

P18 

P17 

'Pi? 

s  an  example, 

let  P11  = 

!/3  P9  = 

2/3 

P18 

u> 

II 

1/4 

!**n,  the  following  is  a 

consistent  current  assignment 

95=  1 

'Pe  = 
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[6.2] 


2 

3  * 


From  this  consistent  current  assignment,  we  can  derive  the  following 
minimal  integer  consistent  current  assignment: 
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The  maximum  computation  rate  Pig  =  p'  X  t£>lg  =  12P1,  where  p'  can  be 
determined  by  applying  Theorem  5.2.1. 

The  example  we  have  worked  out  clearly  shows  the  utility  of 
Theorem  5.2.1.  As  we  have  pointed  out  in  Section  5.2,  an  achievable 
bound  can  be  found  for  Plg  by  drawing  the  timed  c-equivalent  nets  for 
the  Petri  net  in  6.1,13,  and  finding  the  c-equivalent  net  with  the 
minimum  value  of  fundamental  computation  rate.  This  method  is  not 
easy  and  Theorem  5.2.1  gives  a  far  more  tractable  method*  although  the 
bound  so  computed  may  be  overly  optimistic,  since  the  Petri  net  is  not 
a  -  SHA. 


The  timed  event  graph  of  Figure  6.1.9  models  a  production  facility 
in  which  a  deterministic  resource  sharing  strategy  was  used  to  share 
resource  units  between  activities  and  Tbr  We  pointed  out  that  in 
order  to  model  non-determinis cic  resource  allocation  strategies,  we 
need  SMD  Petri  nets.  We  show  such  a  system  in  Figure  6.1.15. 

A  minimal  integer  consistent  current  assignment  is  one  which  assigns 
unit  current  to  each  transition,  i.e., 


"  ”b2  =  “a!  '  ’’bl  =  •••■  ’’el  '^2  =  U 

By  decomposing  the  net  into  its  state  machine  components,  we  see  that 

only  one  state  machine  component  has  changed,  namely  the  one  containing 
the  resource  pool  and  the  transitions  t^,  t^,  tfel  and  tb2.  The  maximum 


fundamental  computation  rate  of  thi3  state  machine  is  7 


+  T  +T  +T 
al  a2  bl  b2 


which  is  the  same  as  that  of  the  simple  circuit  t  0  t  t.  t  in  Figure 

a2  al  b2  bl  & 

6.1.9.  Thus,  the  non-deterministic  strategy  does  not  change  the  compu¬ 
tation  rate  of  the  system.  This  can  be  seen  to  be  true  for  deterministic 
production  facilities,  as  they  have  a  fixed  minimal  integer  consistent 


Resource  pool 


Figure  6.1.15  The  Production  Facility  of  6.1.9  with  a  Non-Pet erministtc 

Resource  Allocation  Strategy. 

current  assignment.  The  reader  is  invited  to  construct  further  examples 
for  himself. 

We  conclude  this  chapter  with  an  example  of  an  ensemble  of  sequential 
processes  which  contend  for  shared  resource  units  in  a  resource  pool. 
Figure  6.1.16(a)  is  an  SMD  Petri  net  model  of  a  system  of  three  sequen¬ 
tial  processes  contending  for  two  pooled  resource  units.  The  minimal 
integer  consistent  current  assignment  models  the  fact  that  the  relative 
frequencies  with  which  Processes  1,2  and  3  are  allocated  resource  units 


-2 


Figure  6.1.16(a) 

are  in  the  ratio  or  1:2:3  (this  is  a  coincidence).  The  net  is  SMA, 
and  the  four  state  machine  components  designat_d  S^,...,S^  are  shown 
in  Figure  6.1.16(b). 

I 

By  applying  Theorem  5.2.1,  the  reader  can  easily  find  the  computation 
rate  of  any  transitions  of  interest. 

It  is  hoped  that  the  material  presented  in  this  chapter  has  given  the 


reader  a  good  overview  of  the  applications  or  the  work  presented  in  the 
earlier  chapters.  The  reader  should  reinforce  his  nderstandir.g  of  this 
material  by  constructing  further  examples  of  his  own. 
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CHAPTER  7 


This  thesis  has  answered  several  questions  but  has  opened  up  many 
others.  Basically,  we  have  laid  to  rest  the  issues  we  raised  in  Chapter 
1.  We  have  shown  how  to  model  asynchronous  concurrent  systems  in  an  eco¬ 
nomical  fashion,  and,  by  augmenting  our  model  with  timing  information,  we 
have  shown  how  to  find  bounds  on  the  computation  rate  of  a  large  class  of 
s  ys  t  ems . 

Our  approach  in  establishing  the  validity  of  the  SMD  Petri  net  model 
for  asynchronous  systems  has  been  heavily  dependent  on  example  and 
intuition.  This  may  seem  rather  unsatisfying  to  some  of  us,  and  we  pose 
the  following  problem  to  our  readers:  can  we  come  up  with  a  set  of  axioms 
that  specifies  the  structure  and  behavior  of  asynchronous  systems,  and  then 
show  that  these  axioms  lead  to  SMD  Petri  nets? 

SMD  Petr*  nets  have  turned  out  to  be  a  very  rich  class  of  structures 
for  representing  asynchronous  concurrent  systems.  However,  we  do  not  have 
any  necessary  and  sufficient  structural  conditions  for  an  SMD  Petri  net 
to  have  a  live  marking,  and  we  pose  this  as  another  problem  to  our  rea¬ 
ders  . 

The  other  major  question  that  remains  to  be  examined  is  to  assume 
firing  times  to  be  random  variables  in  order  to  model  real-world  systems 
more  accurately  than  is  possible  with  the  model  of  Chapter  4.  Good  bounds 
are  needed  for  the  mean  computation  rate  of  such  timed  nets.  Another  issue 
that  seems  to  merit  some  attention  is  to  assume  that  input  queues  to 
an  asynchronous  concurrent  processing  systems  are  fed  by  items  whose 
arrival  rates  are  random  (e.g.,  Poisson).  Thi*  will  lead  to  a  statis- 
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tlcal  fluctuation  in  the  processing  rate  of  the  system,  and  the  effect 
of  buffers  in  smoothing  out  these  fluctuations  can  be  examined.  L  general 
performance  analysis  theory  can  then  be  worked  out  for  asynchronous 
concurrent  systems.  We  think  that  our  thesis  is  a  step  in  that  direc¬ 


tion. 


-211- 


APPENDIX  I 

In  this  appendix,  we  show  that  the  rooted  tree  T(V)  for  any  vector 
addition  system  V  is  finite,  and  the  proof  of  Theorem  2.3.1  is  given. 

The  results  and  te  uniques  in  this  appendix  are  taken  from  Karp  and 
Miller  [K2] . 

Finiteness  of  the  Tree  T(V) 

To  prove  that  T(V)  is  finite  for  any  V  requires  two  lemmas.  The 
term  subsequence  used  here  does  not  refer  necessarily  to  successive 
elements  of  a  sequence.  Thus  1,  3,  4,  15,  79,...  is  a  subsequence  of 
1,2,3,... 

Lemma  1 . 1 .  Let  sq»s^» • • ,sn» • • .  be  an  infinite  sequence  of  elements 

from  (N  U  M)r  for  some  positive  integer  r.  Then  there  exists  an 

infinite  subsequence  s  ,s  ,...,s,  ,...  such  that  s.  £  s .  <:...£  s  .. 

1  12  in  ll  x2  1  . 

Proof,  in  s(J,s1,...sn,...  there  exists  an  infinite  subsequence  that  is 

non-decreasing  in  the  first  element.  In  this  sequence  in  turn,  there 

exists  infinite  subsequence  that  is  non-decreasing  in  the  second  element 

l 

etc. 

Lemma — I • 2  .  (Konig  Infinity  Lemma  [K3] ,  Let  T  be  a  rooted  tree  in  which 
each  vertex  has  only  a  finite  number  of  successors  and  in  which  there  is 
no  infinite  path  directed  away  from  the  root  node.  Then  T  is  finite. 
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Theorem  1 . 1 :  For  any  vector  addition  system  V,  the  tree  ?(V)  is 
finite. 

Proof :  Assume  otherwise  and  let  s^»s2»  *  *  *sn»  *  *  *  a  se<luence  of 

nodes  in  an  infinite  path  directed  away  from  the  root  node.  By  Lemma 

1.1  there  must  exist  an  infinite  subsequence  s,  ,  s,  ,  ...s  , ...  of  this 

11  12  n 

sequence  such  that  £(s  )  £  £(s  )...  £  £(s  )  £  _  Since  none  of 

1  i2  in 

these  nodes  is  an  end,  it  can  never  happen  that  £(s,  )  =  £(s  ).  If 

n  1n+l 

this  were  not  true,  then  the  path  would  be  finite  by  condition  2(a)  in 
the  definition  of  T(V). 

From  condition  2(b)  in  the  definition  of  T(V),  £(s  j  must  have  at 

n+l 

least  one  more  element  equal  to  ^  than  £(s^  )  does.  Since  the  number  of 

n 

elements  is  finite,  we  have  a  contradiction,  and,  therefore,  no  such 
infinite  path  can  exist.  From  Lemma  1.2  it  must  be  the  case  that  T(V) 
is  finite. 

Proof  of  Theorem  2.3.1:  For  any  vector  addition  system  V  and  any  integer 
vector  x  of  the  same  dimension 

(2y  €  R(V)  such  that  x  £  y)  »  (  3  p  6  TO7)  such  that  x  £  £(P)  ). 

Proof :  We  first  show  that  the  right  hand  side  implies  the  left  hand 
side.  The  idea  of  the  proof  is  that,  if  p  is  a  node  in  T(V),  then  there 
are  vectors  in  R(V)  which  agree  with  £(p)  in  its  finite  elements,  and 
can  be  made  arbitrarily  large  in  the  elements  equal  to  w  by  repetition  of 
the  sequence  of  vectors  which  led  to  the  occurrence  of  w.  The  details  of 


the  construction  involve  some  calculation. 
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Suppose  x  <;  £(P).  Let  the  path  from  s  to  p  have  the  successive 

nodes  Tl0,Tl1,...Tlk>  where  s  =  710  and  p  =  Tlk.  For  j  =  l,2,...k,  let 

vj  be  the  vector  associated  with  the  arc  directed  into  ;  i.e., 

=  01j.i)v  .  Assume  without  loss  of  generality  that  the  first  h 

components  of  £(p)  are  equal  to  u,  and  that  the  other  components  are 

less  than  w.  Assume  further  that,  in  the  path  from  s  to  p,  u>'s  are 

introduced  in  the  order  l,2,...h.  Then,  for  each  i,  1  s  i  s  h 

there  exists  a  consecutive  subsequence  t  =  v  v 

M  i  c(i)’  c(i)  +l’**-vd(i) 

such  that  the  vector  u.  -  »c(1)  +  vc(1)+1>...  +  vd(l)  is  positive  in 
the  i-th  element  and  nonnegative  in  the  i+1  st  through  nth  elements. 
Note  that  ^  is  the  subsequence  that  "accounts  for"  the  i-th  u). 

Let  -n  be  a  lower  bound  on  all  the  (negative)  elements  of  u^...,^. 
Also  let  {n^,...,n^}  be  any  set  of  non-negative  integers  satisfying: 

2  (x-s)^  +  it(h  +  L  +  «2  +  n3  +  . . .+ 

n2  *  (x-s)2  +  it(h  +  n3  +  n4  +  .  .  ,+n^) 


(1) 


\  *  (x-s^  +  n(h  +  2  -  i  +  ni+1  +  . .  .+1^) 


^  (x-s)h  +  2lt 
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Such  a  set  must  exist  because  of  the  triangular  form  of  the  inequali¬ 
ties. 

Choose  s^.s^j  •  •  •  such  that  for  1  ^  i  £  h,  s^..^  is  the 

prefix  of  v^v^.-.v^  up  to  the  first  occurrence  of  m  in  the  i-th 

element,  and  v^...^.  111611  the  sequence 

n  n.  n 

1  ^  h 

6  =  s ^t ^  is2t2  •••8hth  sh+l  =  ulu2’”Uf  haS  the  followin8 
properties : 

(a)  s  +  u^  +  ^...Uj  s  x 

(b)  each  partial  sum  s  +  u^  +.  ..u^  is  non-negative. 

We  omit  the  detailed  derivation  of  (a)  and  (b)  from  the  system  of 
inequalities  (1).  To  show  that  the  left  hand  side  implies  the  right 
hand  side,  suppose  that  the  following  are  true:  s  +  u^  +  u^.^u^  6  R(V)  , 

x  £  s  +  u^  +  .  .  .uf,  and  s  +  u^  +. .  .-5-  u^  ^  0,  m  =  1,2, . .  .f ,  where  the 

(u  }are  elements  of  W.  Apply  the  following  operation  to  the  sequence 
m 

s,  s  +  u^,  s  +  u.  +  u2>...s  +  u^  +. . .  +  u^  as  many  times  as  possible: 

Find  the  first  member  of  the  sequence  (call  it  u')  such  that,  for 
some  earlier  member  u'',  u'  <  u'. 

(a)  If  u*'  =  u',  then  delete  all  members  following  u'; 

(b)  otherwise,  for  each  i  such  that  (u1')^  <  (u1)^,  replace  the  i-th 
element  of  u'  and  of  each  vector  beyond  u*  in  the  sequence  by  w. 

It  should  be  clear  that  the  sequence  obtained  at  th-?  conclusion  of 
this  process  is  the  sequence  of  labels  in  some  path  directed  from 
the  root  of  T(V),  and  that  the  final  label  in  this  sequence  is  a  vector 
greater  than  or  equal  to  s  +  u^  +...  +  u^.  Hence,  the  left  hand  side 
implies  the  right  hand  side,  and  the  proof  is  complete. 
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To  illustrate  the  construction  of  the  sequence  6  given  in  the 
first  part  of  the  proof,  the  following  example,  due  to  Karp  and 
Miller,  is  provided: 

Suppose  s  =  (1,1, 1,4)  and  W=  {(0,0, 0,-1),  (2, -1,0,0),  (-1,1, 0,0), 
(-1,-3, 4,0)} .  Consider  the  following  path  in  T(V) : 


(1,1, 1,4  )— (1,1,1, 3) 


\  2, -1,0,0 

(3,0, 1,3)  (u)ti,i, 


3) 

\  -1,1, 0,0 

(u,(J,l,3) - (u),Gd,GJ,3) 


S1  =  (0,0, 0,-1),  (2, -1,0,0),  (-1,1, 0,0)  tt  =  (2, -1,0,0),  (-1,1, 0,0) 
s2  =  (-1, 1,0,0)  t2  =  (-1,1, 0,0) 

s3  =  (-1, -3,4,0) 


t3  =  (-1,-3, 4,0) 


Take  x  -  (22,16,9,3)  £  (^,^,^,3)  and  let  n  =  3.  The  system  of 
inequalities  (1)  for  this  case  is: 

^  21  +  3(4  +  r*2  +  n3) 
n2  ^  15  +  3(3  +  n3) 
n3  *  8  +  3  .  2 

A  solution  is:  n3  =  14,  n2  =  66,  n^  =  273,  giving  the  sequence 
6  =  (0,0, 0,-1),  (2, -1,0,0),  (-1,1, 0,0),  ((2, -1,0,0),  (-1,1, 0,0))273, 
(-1,1, 0,0),  (-1,1, O.O)66,  (-1,-3, 4,0),  (-l,-3,4,0)14,  which 


establishes  that  the  point  (193,23,61,3)  2  x  is  in  R(V) . 
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